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HUMAN ENDOMETRIAL SPECIFIC 
STEROID -BINDING FACTOR I, II and III 

This invention relates, in part, to newly identified 
polynucleotides and polypeptides; variants and derivatives of the 
polynucleotides and polypeptides; processes for making the 
polynucleotides and the polypeptides, and their variants and 
derivatives; agonists and antagonists of the polypeptides; and uses 
of the polynucleotides, polypeptides, variants, derivatives, 
agonists and antagonists. In particular, in these and in other 
regards, the invention relates to polynucleotides and polypeptides 
of human endometrial specific steroid-binding factor I, II and III, 
sometimes hereinafter referred to as "hBSF I, II and III". 

BACKGROUND OF THE INVENTION 

The regulation of cells and tissues is controlled by autocrine 
and paracrine factors, such as systemic hormones and factors that 
modulate or mediate the action of hormones. 

Many peptides, expressed locally, can influence certain 
biological activity in the mairanalian system and are very important 
in the regulation of cells of the epithelium. These factors 
largely have not been identified or characterized, particularly not 
in humans. 

A few factors that play a role in the regulation of functions 
of the lung and uterus, both adult and fetal, have been identified 
in non-human organisms. One such factor is mammalian CC10, i.e., 
human, rat and rabbit CC10. (Wolf, M. et al . , Human Molecular 
Genetics . l(6):37l-378 (1992)). Clara Cell 10 kDa secretory 
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'protein (CC10) which is also called Clara Cell 17 kDa protein, is 
a homociimer consisting of 8.5 kDa monomers that are joined by two 
disulfide oonds (Umland, T.C. ec al . , J. Mol . Biol . . 224:441-448 
(1992)). It is the predominant secreted protein of lung Clara 
cells which are the lining of the bronchiolar epithelium (Singh, 
G. and Katyal, S.L., J. Histochem. Cvtochem. , 32:49-54 (1984)). 
The physiological role of the protein is not yet completely 
understood. It has been reported that CC10 specifically binds 
methylsulf onyl-polychlorated biphenyls (PCBs) (Nordlund, Moler, L. 
et al., J. Biol. Chem. , 265:12690-12693 (1990)) and inhibits 
phospholipase A 2 (Singh, G. et al., Biochem. Biophvs. Acta . 
1039:348-355 (1990)). In the last few years the sequences of rat 
(Katyal, S.L. et al. t Proa. Respir. Res. . 25:29-35 (1990); and 
Hagen, G. et a J . , Nucleic Acids Res. , 18:2939-2946 (1990) ) , and 
human (Singh, G. et al. # Biochem. Biophys. Acta , 950:329-337 (1988) 
CC10 cDNAs have been reported. cDNAs, and the derived amino acid 
sequences,, show striking homologies to rat uteroglobin (Singh, G. 
et al., Biochem. Biophvs. Acta . 1039:348-355 (1990") ; and Hagen, G. 
et al.. Nucleic Acids Res. . 18:2939-2946 (1990)). 

Like CC10, rat uteroglobin is a covalently bound homodiraer 
whose three dimensional structure is well known (Morize, I. et al., 
J . Mol . Biol . . 194:725-739 (1987). Uteroglobin expression in 
rabbits hac been originally reported in the uterus during the 
pre implantation phase (Beier, H.M., Biochem. Biophvs. Acta . 
160:289-290 (1968)) . More recently, the protein was also detected 
in oviduct (Kirchner, C. , Cell Tissue Res. . 170:490-492 (1976)), 
male genital organs (Beier, H.M. et al . , Cell Tissue Res. , 165:1-11 
(1975)), esophagus (Noske. I.G. and Peigelson, M. , Biol . Reprod. . 
15:704-713 (1976)) and lung (Noske, supra; and Torkkeli, T. etal., 
Biochem. Biophvs. Acta . 544:578-592 (1978)). 

in vitro, several distinct properties of uteroglobin have been 
described. Soon after its discovery it could be shown that the 
steroid hormone progesterone is specifically bound by the protein 
(Beato, M. and Baier. R. f Biochem. Biophvs. Acta , 392:346-356 
(1975); and Beato, M. et al . , J. Steriod Biochem. , 8:725-730 
(1977)). Therefore, rabbit uteroglobin was believed to be a 
potential carrier or scavenger of progesterone that regulates the 
progesterone concentration in the endometrium (Atger, M. et al., 
J. Steroid Biochem. , 13:1157-1162 (1980)). It has also been shown 
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to specifically bind certain methylsulf onyl metabolites of 
polychlorinated biphenyls with even higher affinity than 
progesterone (Gillner, M. et al . . J. Steroid Biochem. , 31:27-33 
(1988)). Furthermore, uteroglobin has been found to inhibit 
phospholipase A : . The relationships of all these properties and 
their physiological significance is still not understood and 
remains largely a matter of speculation. 

The rat CC10 mRNA is expressed like rat uteroglobin not only 
in lung but also in the esophagus as well in uteri of estrogen and 
progesterone treated female rats (Hagen, G. 1990 supra) suggesting 
that rat CC10 is the rat counterpart of rat uteroglobin (see in 
general Wolf, M . et al., Human Molecular Genetics , l(6):37l-378 
(1992)). 

Human CC10 expression is abundant in non-neoplastic human 
lung, and it is detectable in tumors in corresponding cell lines 
at markedly lower levels (Broers, J.L.V. et al., Lab . Invest . , 
66:337-346 (1992); Linnoila, R.I. etal., Amer.l J. Clin. Pathol 
90:1-12 (1988)). CC10 levels were also significantly lower in 
serum and bronchoalveolar lavage specimens obtained from smokers 
and lung cancer patients compared with specimens from healthy non- 
smokers (Bernard, A. et al Europ. Resp. J. , 5 : 1231-1238 (1992) ) . 

These findings suggest the expression of CC10 mRNA becomes 
altered in distinct lung compartments and may implicate a role for 
CC10 in the development of pulmonary carcinomas (Jensen, S.M. et 
al., Int. J. Cancer . 58:629-637 (1994). 

Some of the biological properties of UG, such as masking the 
antigenicity of blastomers (Mukherjee, A.B., et al., Med. 
Hypotheses . 6:1043-1055 (1980)) and epididytnal spermatozoa 
(Mukherjee, D.C., et al . , Science (Wash. P.O . 219:989-991 
(1983)), inhibition of monocyte and neutrophil chemo taxis and 
phagocytosis in vitro (Schiffman, E.V., et al., Agents Actions 
Supp l . . 12:106-120 (1983)), and inhibition of ADP- and thrombin- 
induced (but not of arachidonic acid- induced) platelet aggregation 
(Manjunath, R., et al . , Biochem. Pharmacol . . 36:741-746 (1987)), 
may be due, at least in part, to the potent inhibitory effect of 
this protein on PLA : activity (Levin, S.W., et al . , Life Sci . . 
38:1813-1819 (1986)1. A nonapeptide derived from the amino acid 
sequence of of-helix-3 of UG monomer (residues 39-47) possesses all 
the biological properties of the intact protein and has been 
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identified as an active site of UG responsible for its PLA : - 
inhibitory and antiinflammatory activities (Miele, L. , et a!., 
N a t-i,r» fLond.) . 335:726-730 (1988)). 

It has been indicated that cclOkD- specif ic transcripts are 
present in several nonrespiratory human organs and tissues. By 
using an antibody to rabbit UG. a UG-like itnmunoreactivity in human 
endometrium (Kikukawa, T. , et al . , ,7 nig. Endocrinol. Meta b^, 
67:315-321 (1988)), prostate (Manyak, M.J., et al., J . Uro.U , 
140:176-182 (1988)). and respiratory tract (Dhanireddy, R. , et al . , 
a ^H» m Hior-Hyc commun.. 152:1447-1454 (1988)). has been 

described. 

Recently, the cDNA (Singh. G., etal., Biochem. Biophys. Acta , 
9S0: 329-337 (1988)) and the 5' regions (Wolf, M. , et al., Human 
Mo1 Genet . , 1:371-378 (1992)) of the gene encoding human 
uteroglobin (HOB), a counterpart of rabbit UG (rUG) . has been 
characterized. Human UG or Clara cell 10-kD protein has 61.5% amino 
acid sequence identity with rUG (Singh, G. . et al., BifiShjm,. 
n 1 » r hy R Acta . 950:329-337 (1988)), S4.2% similarity with rat UG 
(Singh. G. , et al., ■1r-t™» ««»°hv. Acta. 1039:348-355 (1990)), 
and 52.8* with mouse UG (Singh, G.. et al., BXP , ^nq W. * ^9:61- 
75 (1993) ) . Although this protein was originally discovered in the 
alveolar Clara cells (Singh. G. , et al., ,T H^ochem, , 36:73-80 
(1988)) it is detectable in many extrapulmonary tissues similar to 
the ones in which rUG is expressed (Peri, A., et al., BH&_Cell 
Biol 5:495-503 (1994)) and this expression is induced by 
progesterone. It appear that some of the biological properties of 
hUG are virtually identical to rUG (Mantile, G.. et al., Q. Biol. 
Chem. . 27:20343-20351 (1993)). 

It has been reported that UG in the rabbit uterine fluid is 
first detectable on day 3 of pregnancy, and peak level is reached 
on day 5 (for a review see Miele. L.. et al.. Endocr. Rev. ,, 8:474- 
490 (1987) ) . UG. by inhibiting PLA : activity, may down-regulate the 
production of proinflammatory lipid mediators, which promote 
contraction and motility of the uterine smooth muscle. Therefore, 
it is suggested that UG facilitates the maintenance of myometrial 
quiescence during gestation. 

There is a clear need in the art to further isolate and 
characterize proteins which are homologues of mammalian Clara cell 
10 kDa secretory protein and rat prostatic steroid-binding protein. 
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The genes and gene produces of the present invention display 
homology to the rat prostatic steroid-binding protein and Clara 
cell 10 kDa secretory protein. 

SUMMARY OP THE INVENTION 

Toward these encN. and others, it is an object of the present 
invention to provide polypeptides, inter alia, that have been 
identified as novel hESF I, II and III by homology between the 
amino acid sequence set out in Figures 1, 2 and 3 (SEQ ID NO:2, 4 
and 6) and known amino acid sequences of other proteins such as rat 
prostatic steroid-binding protein. 

It is a further object of the invention, moreover, to provide 
polynucleotides that encode hESF I, II and III, particularly 
polynucleotides that encode the polypeptides herein designated hESF 
I, II and III. 

In a particularly preferred embodiment of this aspect of the 
invention the polynucleotides comprise the regions encoding human 
hESF I, II and III in the sequence set out in Figures 1, 2 and 3 
(SEQ ID NO:2, 4 and 6) . 

In accordance with this aspect of the present invention there 
is provided isolated nucleic acid molecules encoding mature 
polypeptides expressed by the human cDNA contained in ATCC Deposit 
No. 97401 !ESF I), 97402 (ESF II) and 97403 (BSF III). 

In accordance with this aspect of the invention there are 
provided isolated nucleic acid molecules encoding human hESF I, II 
and III, including mRNAs, cDNAs , genomic DNAs and, in further 
embodiments of this aspect of the invention, biologically, 
diagnostically, clinically or therapeutically useful variants, 
analogs or derivatives thereof, or fragments thereof, including 
fragments of the variants, analogs and derivatives. 

Among the particularly preferred embodiments of this aspect 
of the invention are naturally occurring allelic variants of human 
hESF I, II and III. 

It also is an object of the invention to provide hESF I, II 
and III polypeptides, particularly human hESF I, II and III 
polypeptides, that treat and/or prevent inflammation, asthma, 
rhinitis, cystic fibrosis, airway disease, nieoplasia, atopy, 
inhibit phospholipase A, activity, bind polychlorinated biphenyls, 
reduce foreign protein antigenicity, inhibit monocyte and 
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neutrophil chemotaxis and phagocytosis. inhibit platelet 
aggregation, regulate eicosanoid levels in the human uterus and 
control the growth of endometrial cells. 

in accordance with this aspect of the invention there are 
provided novel polypeptides of human origin referred to herein as 
hBSF I. II and ,111 as well as biologically, diagnostically or 
therapeutically useful fragments, variants and derivatives thereof, 
variants and derivatives of the fragments, and analogs of the 
foregoing. 

Among the particularly preferred embodiments of this aspect 
of the invention are variants of human hBSF I. II and III encoded 
by naturally occurring alleles of the human hBSF I. II and III 
genes . 

It is another object of the invention to provide a process for 
producing the aforementioned polypeptides, polypeptide fragments, 
variants and derivatives, fragments of the variants and 
derivatives, and analogs of the foregoing. In a preferred 
embodiment of this aspect of the invention there are provided 
methods for producing the aforementioned hBSF I. II and III 
polypeptides comprising culturing host cells having expressibly 
incorporated therein an exogenously-derived human hBSF I. II or III 
encoding polynucleotide under conditions for expression of human 
hESF I. II and III in the host and then recovering the expressed 
polypeptides. 

In accordance with another object of the invention there are 
provided products, compositions, processes and methods that utilize 
the aforementioned polypeptides and polynucleotides for research, 
biological, clinical and therapeutic purposes, inter alia. 

in accordance with certain preferred embodiments of this 
aspect of the invention, there are provided products, compositions 
and methods, inter alia. for. among other things: assessing hBSF 
I II and III expression in cells by determining hBSF I. II and III 
polypeptides or hBSF I, II and Ill-encoding mRNA; expressing hBSF 

I II and III in vitro, ex vivo or in vivo by exposing cells to 
hESF I. II and III polypeptides or polynucleotides as disclosed 
herein; assaying genetic variation and aberrations, such as 
defects, in hESF I. II and III genes; and administering a hBSF I, 

II and III polypeptide or polynucleotide to an organism to augment 
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hESF I, II and III f miction or remediate hESF I, II and III 
dysfunction. 

In accordance with certain preferred embodiments of this and 
other aspects of the invention there are provided probes that 
hybridize to human hESF I, 11 and III sequences. 

In certain additional preferred embodiments of this aspect of 
the invention there are provided antibodies against hESF I, II and 
III polypeptides. In certain particularly preferred embodiments 
in this regard, the antibodies are highly selective for human hESF 
I, II and III. 

In accordance with another aspect of the present invention, 
there are provided hESF I, II and III agonists. Among preferred 
agonists are molecules that mimic hESF I, II and III, that bind to 
> hESF I, II and III -binding molecules or receptor molecules, and 
that elicit or augment hESF I, II and Ill-induced responses. Also 
among preferred agonists are molecules that interact with hESF I, 

II and III polypeptides, or with other modulators^ of hESF I, II and 

III activities, and thereby potentiate or augment an effect (s) of 
hESF I, II and III. 

In accordance with yet another aspect of the present 
invention, there are provided hESF I, II and III antagonists. 
Among preferred antagonists are those which mimic hESF I, II and 
III so as to bind to hESF I, II and III receptors or binding 
molecules but not elicit a hESF I, II and III -induced response or 
more than one hESF I, II and I II -induced response or which prevent 
expression of hESF I, II and III. Also among preferred antagonists 
are molecules that bind to or interact with hESF I, II and III so 
as to inhibit an effect (s) of hESF I, II and III. 

The agonists and antagonists may be used to mimic, augment or 
inhibit the action of hESF I, II and III polypeptides. They may 
be used, for instance, to treat and/or prevent an inherited 
susceptibility to asthma. 

In a further aspect of the invention there are provided 
compositions comprising a hESF I, II or III polynucleotide or a 
hESF I, II or III polypeptide for administration to cells in vitro, 
to cells ex vivo and to cells in vivo, or to a multicellular 
organism. In certain particularly preferred embodiments of this 
aspect of the invention, the compositions comprise a hESF I, II or 
III polynucleotide for expression of a hESF I, II or III 
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polypepuide in a host organism for treatment of disease. 
Particularly preferred in this regard is expression in a human 
patient for treatment of a dysfun-ion associated with aberrant 

endogenous activity. 

Other objects, features, advantages and aspects of the present 
invention will become apparent to those of skill from the following 
description. It should be understood, however, that the following 
description and the specific examples, while indicating preferred 
embodiments of the invention, are given by way of illustration 
only various changes and modifications within the spirit and 
scope of the disclosed invention will become readily apparent to 
those skilled in the art from reading the following description and 
from reading the other parts of the present disclosure. 

BRIEF DESCRIPTION OP THE DRAWINGS 

The following drawings depict certain embodiments of the 
invention. They are illustrative only and do not limit the 
invention otherwise disclosed herein. 

Figure 1 shows the nucleotide and deduced amino acid sequence 

of human hBSF I. ^ 

Figure 2 shows the nucleotide and deduced amino acid sequence 

of human hESF II. 

Figure 3 shows the nucleotide and deduced amino acid sequence 

of human hESF III. 

Figure 4 shows the regions of similarity between amino acid 
sequences of hESF I and rat prostatic steroid-binding protein 

polypeptides (SEQ ID NO:25) . 

Figure 5 shows the regions of similarity between amino acid 
sequences of hESF II and rat prostatic steroid-binding protein 

polypeptides (SEQ ID NO: 26) . 

Figure 6 shows the regions of similarity between amino acid 
sequences of hESF III and rat prostatic steroid-binding protein 
polypeptides (SEQ ID NO: 27). 

Figure 7 shows structural and functional features of hESF I 
deduced by the indicated techniques, as a function of amino acid 
sequence . 
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Figure 8 shows structural and functional features of hBSP II 
deduced by the indicated techniques, as a function of amino acid 
sequence . 

Figure 9 shows structural and functional features of hESF III 
deduced by the indicated techniques, as a function of amino acid 
sequence . 

GLOSSARY 

The following illustrative explanations are provided to 
facilitate understanding of certain terms used frequently herein, 
particularly in the examples. The explanations are provided as a 
convenience and are not limitative of the invention. 

DIGESTION of DNA refers to catalytic cleavage of the DNA with 
a restriction enzyme that acts only at certain sequences in the 
DNA. The various restriction enzymes referred to herein are 
commercially available and their reaction conditions, cof actors and 
other requirements for use are known and routine to the skilled 
artisan. 

For analytical purposes, typically, l pig of plascnid or DNA 
fragment is digested with about 2 units of enzyme in about 20 pi 
of reaction buffer. For the purpose of isolating DNA fragments for 
plasmid construction, typically 5 to 50 ng of DNA are digested with 
20 to 250 units of enzyme in proportionately larger volumes. 

Appropriate buffers and substrate amounts for particular 
restriction enzymes are described in standard laboratory manuals, 
such as those referenced below, and they are specified by 
commercial suppliers . 

Incubation times of about 1 hour at 37 *C are ordinarily used, 
but conditions may vary in accordance with standard procedures, the 
supplier's instructions and the particulars of the reaction. After 
digestion, reactions may be analyzed, and fragments may be purified 
by electrophoresis through an agarose or polyacrylamide gel, using 
well known methods that are routine for those skilled in the art. 

GENETIC ELEMENT generally means a polynucleotide comprising 
a region that encodes a polypeptide or a region that regulates 
transcription or translation or other processes important to 
expression of the polypeptide in a host cell, or a polynucleotide 
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•comprising both a region that encodes a polypeptide and a region 
operably. linked r. hereto that regulates expression. 

Genetic elements may be comprised within a vector that 
replicates as an episomal element; that is, as a molecule 
physically independent of the host cell genome. They may be 
comprised within mini -chromosomes, such as those that arise during 
amplification of transacted DMA by methotrexate selection m 
eukaryotic cells. Genetic elements also may be comprised within 
a host cell genome; not in their natural state but, rather, 
following manipulation such as isolation, cloning and introduction 
into a host cell in the form of purified DMA or in a vector, among 

° ther iSOLATBD means altered "by the hand of man" from its natural 
state; i.e., that, if it occurs in nature, it has been changed or 
removed from its original environment, or both. 

For example, a naturally occurring polynucleotide or a 
polypeptide naturally present in a living animal in its natural 
state is not "isolated," but the same polynucleotide or polypeptide 
separated from the coexisting materials of its natural state is 
-isolated-, as the term is employed herein. For example, with 
respect to polynucleotides, the term isolated means that it is 
separated from the chromosome and cell in which it naturally 

° CCUr As part of or following isolation, such polynucleotides can 
be joined to other polynucleotides, such as DMAs, for 
co form fusion proteins, and for propagation or expression in a 
host, for instance. The isolated polynucleotides, alone or joined 
to other polynucleotides such as vectors, can be -tr^uced ,nto 
host cells in culture or in whole organisms. Introduced into host 
cells L culture or in whole organisms, such DMAs still would be 
isolated, as the term is used herein, because they wouU i not ^be in 
their naturally occurring form or environment,. Similarly, the 
polynucleotides and polypeptides n«y occur in a 
fs a media formulations, solutions for introduction of 
polynucleotides or polypeptides, for example. ™" J* 11 *' 
Compositions or solutions for chemical or enzymatic reactions .for 
instance, which are not naturally occurring compositions and 
therein remain isolated polynucleotides or polypeptides vithin the 
meaning of that term as it is employed herein. 
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LIGATION refers to the process of forming phosphodiester bonds 
between two or more polynucleotides, which most often are double 
stranded DNAs. Techniques for ligation are well known to the art 
and protocols for ligation are described in standard laboratory 
manuals and references, such as, for instance, Sambrook et al. ( 
MOLECULAR CLONING, A LABORATORY MANUAL, 2nd Ed. ; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989) and Maniatis 
- et al., pg. 146, as cited below. 

OLIGONUCLEOTIDE (S) refers to relatively short polynucleotides. 
Often the term refers to single -stranded deoxyribonucleotides , but 
it can refer as well to single-or double- stranded ribonucleotides, 
RNA:DNA hybrids and double -stranded DNAs, among others. 

Oligonucleotides, such as single- stranded DNA probe 
oligonucleotides, often are synthesized by chemical methods, such 
as those implemented on automated oligonucleotide synthesizers. 
However, oligonucleotides can be made by a i variety of other 
methods, including in vitro recombinant DNA-mediated techniques and 
by expression of DNAs in cells and organisms. 

Initially, chemically synthesized DNAs typically are obtained 
without a 5' phosphate. The 5' ends of such oligonucleotides are 
not substrates for phosphodiester bond formation by ligation 
reactions that employ DNA ligases typically used to form 
recombinant DNA molecules. Where ligation of such oligonucleotides 
is desired, a phosphate can be added by standard techniques, such 
as those that employ a kinase and ATP. 

The 3» end of a chemically synthesized oligonucleotide 
generally has a free hydroxyl group and, in the presence of a 
ligase, such as T4 DNA ligase, readily will form a phosphodiester 
bond with a 5' phosphate of another polynucleotide, such as another 
oligonucleotide. As is well known, this reaction can be prevented 
selectively, where desired, by removing the 5' phosphates of the 
other polynucleotide (s) prior to ligation. 

PLASMIDS generally are designated herein by a lower case p 
preceded and/or followed by capital letters and/or numbers, in 
accordance with standard naming conventions that are familiar to 
those of skill in the art. 

Starting plasmids disclosed herein are either commercially 
available, publicly available on an unrestricted basis, or can be 
constructed from available plasmids by routine application of well 
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known, published procedure. Many plasmids and other cloning and 
expression vectors that can be used in accordance with the present 
invention are well known and readily available to those of skill 
in the art. Moreover, those of skill readily may construct any 
number of other plasmids suitable for use in the invention. The 
properties, construction and use of such plasmids. as well as other 
vectors, in the present invention will be readily apparent to those 
of skill from the present disclosure. 

POLYNUCLEOTIDE {S ) generally refers to any polyribonucleotide 
or polydeoxribonucleotide, which may be unmodified RNA or DNA or 
modified RNA or DNA. Thus, for instance, polynucleotides as used 
herein refers to, among others, single-and double- stranded DMA, DNA 
that is a mixture of single-and double -stranded regions, single- 
and double-stranded RNA, and RNA that is mixture of single- and 
double-stranded regions, hybrid molecules comprising DNA and RNA 
that may be single -stranded or, more typically, double -stranded or 
a mixture of single- and double -stranded regions. In addition, 
polynucleotide as used herein refers to triple- stranded regions 
comprising RNA or DNA or both RNA and DNA. The strands in such 
regions may be from the same molecule or from different molecules. 
The regions may include all of one or more of the molecules, but 
more typically involve only a region of some of the molecules. One 
of the molecules of a triple -helical region often is an 

oligonucleotide . 

As used herein, the term polynucleotide includes DNAs or RNAs 
as described above that contain one or more modified bases. Thus. 
DNAs or RNAs with backbones modified for stability or for other 
reasons are "polynucleotides- as that tertn is intended herein. 
Moreover. DNAs or RNAs comprising unusual bases, such as inosine, 
or modified bases, such as tritylated bases, to name just two 
examples, are polynucleotides as the term is used herein. 

It will be appreciated that a great variety of modifications 
have been made to DNA and RNA that serve many useful purposes known 
to -hose of skill in the art. The term polynucleotide as it is 
employed herein embraces such chemically, enzymatically or 
metabolically modified forms of polynucleotides, as well as the 
chemical forms of DNA and RNA characteristic of viruses and cells, 
including simple and complex cells, inter alia. 
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POLYPEPTIDES , as used herein, includes all polypeptides as 
described below. The basic structure of polypeptides is well known 
and heir been described in innumerable textbooks and other 
publications in the art. In this context, the term is used herein 
to refer to any peptide or protein comprising two or more amino 
acids joined to each other in a linear chain by peptide bonds. As 
used herein, the term refers to both short chains, which also 
commonly are referred to in the art as peptides, oligopeptides and 
oligomers, for example, and to longer chains, which generally are 
referred to in the art as proteins, of which there are many types. 

It will be appreciated that polypeptides often contain amino 
acids other than the 20 amino acids commonly referred to as the 20 
naturally occurring amino acids, and that many amino acids, 
including the terminal amino acids, may be modified in a given 
polypeptide, either by natural processes, such^ as processing and 
other post-translational modifications, but also by chemical 
modification techniques which are well known to the art. Bven the 
common modifications that occur naturally in polypeptides are too 
numerous to list exhaustively here, but they are well described in 
basic texts and in more detailed monographs, as well as in a 
voluminous research literature, and they are well known to those 
of skill in the art. Among the known modifications which may 

be present in polypeptides of the present are, to name an 
illustrative few, acetylation, acylation, ADP-ribosylation, 
amidation, covalent attachment of flavin, covalent attachment of 
a heme moiety, covalent attachment of a nucleotide or nucleotide 
derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphot idyl inositol, cross -linking, 
cyclization, disulfide bond formation, demethylation, formation of 
covalent cross -links, formation of cystine, formation of 
pyroglutamate, formylation, gatrana-carboxylation, glycosylation, GPI 
anchor formation, hydroxylation, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, phosphorylation, 
prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, 
and ubiquitination. 

Such modifications are well known to those of skill and have 
been described in great detail in the scientific literature. 
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Several particularly cotnmon modifications, glycosylate, lipid 
attachment. sulfation, gamma -carboxylatxon of glutamic acid 
residues, hydroxylation and ADP-nbosylation, for instance are 
described in most basic texts, such as. for instance PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES. 2nd Ed.. T. B. Creighton. W. 
H Freeman and Company. New York (1993). Many detailed reviews are 
available on this subject, such as. for example, those provided by 
„old P Posctranslational Protein Modifications : Perspectives and 

Tweets, pgs. 1-12 in posttranslational covalek, ;««^« » 
PROTEINS. B. C. aonnson. Ed.. Academic Press New 
Seifter et al., Analysis for protein modif ications and nonprotein 
cofactors, Meth. Enzymol. 182: 626-646 (1990) and Rattan et ^, 
Protein Synthesis: Post trans lational Modifications and Aging. Ann. 
NY Acad. Sci. 663= 48-62 (1992). 

' it will be appreciated, as is well known and as noted above, 
that polypeptides are not always entirely linear. For instance 
polypIptIL may be branched as a result of ubiouitinatxon and 
Vte7ly be circular, with or without branching, generally as a 
result of posttranslation events, including natural Passing 
event and events brought about by human manipulation which do not 
Naturally Circular, branched and branched circular 
7Z^ £ * * ^-translation natural process 

-a w.r fl fi^- -piv synthetic methods, as well. 

the Pepcid, backbone, the amino acid std.-ch.in. and the amino or 
cartel termini. In fact. blockage o£ the amino or carboxyl group 
in Hll^tide. or both. by a covalent modification, i. ~ 
in na^urluy -currin, and .ynthetic polypeptide • 
modification, may he pre..nt in polypeptide, | ° £ <*; ^ 
— <™ as well For instance, the amino terminal residue or 
P^Uder ^e in .. coll. prior to ^ -g. 
almost invariably -ill be N-formylmethlonine. 

The modifications that occur in a polypeptide often .HI be 
a function of ho. it is made. For polypeptides made , by e^ressing 
a cloned gene in a host, for instance, the nature and extent ^ of the 
modification, in large part will be determined by the M cell 
posttranslational modification capacity and the 
signals present in the polypeptide amino acid sequence. For 
"ranee as is well *now„. gly cosylation often doe. not occur in 
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bacterial hosts such as E. coli. Accordingly, when glycosylation 
is desired, a polypeptide should be expressed in a glycosylating 
host, generally a tu/caryocic cell, insect cell often carry out the 
same posttranslational glycosylations as mammalian cells and, for 
this reason, insect cell expression systems have been developed to 
express efficiently mammalian proteins having native patterns of 
glycosylation, inter alia. Similar considerations apply to jther 
modifications . 

It will be appreciated that the same type of modification may 
be present in the same or varying degree at several sites in a 
given polypeptide., Also, a given polypeptide may contain many 
types of modifications. 

In general, as used herein, the term polypeptide encompasses 
all such modifications, particularly those that are present in 
polypeptides synthesized by expressing a polynucleotide in a host 
cell. 

VARIANT (S) of polynucleotides or polypeptides, as the term is 
used herein, are polynucleotides or polypeptides that differ from 
a reference polynucleotide or polypeptide, respectively. Variants 
in this sense are described below and elsewhere in the present 
disclosure in greater detail. 

(1) A polynucleotide that differs in nucleotide sequence from 
another, reference polynucleotide. Generally, differences are 
limited so that the nucleotide sequences of the reference and the 
variant are closely similar overall and, in many regions, 
identical. 

As noted below, changes in the nucleotide sequence of the 
variant may be silent. That is, they may not alter the amino acids 
encoded by the polynucleotide. Where alterations are limited to 
silent changes of this type a variant will encode a polypeptide 
with the same amino acid sequence as the reference. Also as noted 
below, changes in the nucleotide sequence of the variant may alter 
the amino acid sequence of a polypeptide encoded by the reference 
polynucleotide. Such nucleotide changes may result in amino acid 
substitutions, additions, deletions, fusions and truncations in the 
polypeptide encoded by the reference sequence, as discussed below. 

(2) A polypeptide that differs in amino acid sequence from 
another, reference polypeptide. Generally, differences are limited 
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So that the sequences of the reference and the variant are closed 
similar overall and. in many region, identical. 

A variant and reference polypeptide may differ in amino acid 
sequence by one or more substitutions, additions, 
lumens and truncations, which -y be present in any combination. 

RECEPTOR HOLECUI*. as used herein, refers to molecules which 

hind ofLTeract specifically with hBSP I, It and III polypeptides 

of the preset invention, including not only classic receptors. 

LcTare preferred, but also other molecules that specifically 

b^ to or!nteract with polypeptides of the invention -which also 

he referred to as -binding molecules- and "interaction 
may be referred to ^ binding 

^r^and^ 1 ; II £«" inaction -iecul...- Binding 
^ "olyp pXs - the invention and such Macules, including 
receptor or binding or interaction molecules may be exclusive to 

^otide. of the invention, which is very highiy preferred, or 
TZ T^V »P«ifi= for polypoid., of ^invention, which 
is MghTy preferred, or it may be highly specific to a group o 
pro«!ns tLt includes polypeptides of the invention, which is 
or it may be specific to sever.! group, of proteins at 
Hast one of which includes polypeptides of the invention. 

Receptor, also »ay be non-n.tur.Uy occurring such as 

antibodies and antibody-derived reagents that bind to polypeptides 

of the invention. 

DESCRIPTION OF THE INVBWTION 
The present invention relates to novel hBSF I. II ^ and III 
^iL - polvnudeoti.es, other ^^^^ 

in greater detail below. In particular, the inventio 
peptides and polynucleotides of novel ^ ^ ^ ' « - 
which are related by amino acid sequence homology to 
Static steroid-binding protein. The invention relates 
P er=iallV co hES, , II and „: having ^"^^ ~ 

is herein referred to as "the deposited clone or nucleoCi(ie 
the deposited clone." It will be appreciated that the 
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and amino acid sequences sec out in Figures l, 2 and 3 (SEQ ID 
NO: 2, 4 and 6) were obtained by sequencing the cDNA of the 
deposited clone. Hence, the sequence of the deposited clone is 
controlling as to any discrepancies between the two and any 
reference to the sequences of Figures 1, 2 and 3 (SEQ ID NO: J , 3 
, and 5) include reference to the sequence of the human cDNA of the 
deposited claim. 

Polynucleotides 

In accordance with one aspect of the present invention, there 
are provided isolated polynucleotides which encode hESP I, II and 
ill polypeptides having the deduced amino acid sequences of Figures 
1, 2 and 3 (SEQ ID N0:2, 4 and 6) . 

Using the information provided herein, such as the 
polynucleotide sequences set out in Figures 1, 2 and 3 (SBQ ID 
NO:l, 3 and 5) a polynucleotide of the present invention encoding 
human hESF I, II and III polypeptided may be obtained using 
standard cloning and screening procedures, such as those for 
cloning cDNAs using mRNA from cells of a human endometrial tumor 
as starting material. Illustrative of the invention, the 
polynucleotide set out in Figure 1 (SBQ ID NO : 1 ) was discovered in 
a cDNA library derived from cells of a human endometrial tumor. 
The polynucleotide of Figure 2 (SEQ ID NO: 3) was discovered in a 
cDNA library derived from cyclohexamide treated CEM cells. The 
polynucleotide of Figure 3 (SBQ ID NO: 5) was discovered in a cDNA 
library derived from human endometrial tumor. 

Human hESF I of the invention is structurally related to other 
proteins of the Clara cell secretory protein family, as shown by 
the results of sequencing the cDNA encoding human hESF I in the 
deposited clone. The cDNA sequence thus obtained is set out in 
Figure 1 (SEQ ID N0:1) • It contains an open reading frame encoding 
a protein of about 90 amino acid residues, wherein the initial 21 
amino acid residues represent a putative leader sequence, with a 
deduced molecular weight of the full-length protein of about 9.8 
kDa. The protein exhibits greatest homology to the rat prostatic 
steroid -binding protein, among known proteins. The protein hESF 
I has about 46.067% identity and about 66.3% similarity with the 
amino acid sequence of the rat prostatic steroid-binding protein. 
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Human hBSF II contains an open reading frame encoding a 
procein of about 90 amino acid residues, wherein the initial 21 
amino acid residues represent a putative leader sequence, with a 
deduced molecular weight of the full-length protein of about 9.9 
kDa The protein exhibits greatest homology to the rat prostatic 
steroid-binding protein, among known proteins. The protein hBSF 

II has about 49.438% identity and about 71.910% similarity with the 
amino acid sequence of rat prostatic steroid- binding protein C2. 

Human hBSF III contains an open reading frame encoding a 
protein of about 95 amino acid residues, wherein the initial 21 
amino acid residues represent a putative leader sequence, wxth a 
deduced molecular weight of the full-length protein of about 8.10 
jcoa The protein exhibits greatest homology to rat prostatic 
steroid-binding protein C3 , among known proteins. The protein hBSF 

III has about 36.2% identity and about 64.9% similarity with the 
amino acid sequence of the rat prostatic steroid-binding protein 

Polynucleotides of the present invention may be in the form 
of RNA such as mRNA, or in the form of DNA. including, for 
instance. cDNA and genomic DNA obtained by cloning or ~ ed ^ 
chemical synthetic techniques or by a combination ^ ^ 

^ y oe double-stranded or single-stranded. Single-stranded DHA may 
be the coding strand/also known as the sense strand, or it may be 
the non-coding strand, also referred to as the anti-sense strand. 

The coding sequence which encodes the polypeptides may be 
identical to the coding sequence of the polynucleotides shown in 
Figures l. 2 and 3 (SBQ ID NO:l, 3 and 5) • It ^ 
po!ynucleotide with a different sequence, which, as a result of the 

» nonet ic code, encodes the 

redundancy (degeneracy) of the genetic co . 

polypeptides of the human cDNA of Figures l, 2 and 3 (SBQ ID NO.i, 

3 ^Polynucleotides of the present invention which encode the 
polypeptides of Figures 1. 2 and 3 (SBQ ID NO:l, 3 and 5) may 
include, but are not limited to the coding sequence for the mature 
polypeptide, by itself; the coding sequence for the mature 
loZZ^e and additional coding sequences, such as those encoding 
a leader or secretory sequence, such as a pre-, or pro- or prepro- 
protein sequence,- the coding sequence of the mature polypeptide. 
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with or without the aforementioned auditional coding sequences, 
together with additional, non-coding sequences, including for 
example, but net limited to ir-rons and non-coding 5' and 3' 
sequences, such as the transcribed, non- translated sequences that 
play a role in transcription, mRNA processing - including splicing 
and polyadenylation signals, for example - ribosome binding and 
stability of mRNA; additional coding sequence which codes for 
additional amino acids, such as those which provide additional 
functionalities. Thus, for instance, the polypeptide may be fused 
to a marker sequence, such as a peptide, which facilitates 
purification of the fused polypeptide. In certain preferred 
embodiments of this aspect of the invention, the marker sequence 
is a hexa-histidine peptide, such as the tag provided in the pQB 
vector (Qiagen, Inc., among others, many of which are commercially 
available. As described in Gentz et al,, Proc. Natl. Acad. Sci., 
USA 86: 821-824 (1989), for instance, hexa-histidine provides for 
convenient purification of the fusion protein. The HA tag 
corresponds to an epitope derived of influenza hemagglutinin 
protein, which has been described by Wilson et al., Cell 37: 767 
(1984), for instance. 

In accordance with the foregoing, the term "polynucleotide 
encoding a polypeptide" as used herein encompasses polynucleotides 
which include a sequence encoding a polypeptide of the present 
invention, particularly human hBSP I, II and III having the amino 
acid sequences set out in Figures 1, 2 and 3 (SBQ ID NO: 2, 4 and 
6) . The term encompasses polynucleotides that include a single 
continuous region or discontinuous regions encoding the polypeptide 
(for exan^le, interrupted by introns) together with additional 
regions, that also may contain coding and/or non-coding sequences. 

The present invention further relates to variants of the 
herein above described polynucleotides which encode for fragments, 
analogs and derivatives of the polypeptides having the deduced 
amino acid sequences of Figures 1, 2 and 3 (SEQ ID NO: 2, 4 and 6) . 
A variant of the polynucleotide may be a naturally occurring 
variant such as a naturally occurring allelic variant, or it may 
be a variant that is not known to occur naturally. Such non- 
naturally occurring variants of the polynucleotide may be made by 
mutagenesis techniques, including those applied to polynucleotides, 
cells or organisms. 
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... Mong variants in this regard are variants that differ fro. 
aTelentioned poiynucleoti.es hy nucleotide : ^ 

, =,Mitions The substitutions, deletions or additions 
deletions or ^ vaIimt! ™y be altered 

rcay involve one or more nuc ^rations in the coding 

in coding or non-coding region, or both. ^ ^ 

regions «>y produce conservative or non-conservati 

substitutions. ^^• tmaasmm of the invention 

^ t he ^^^^ polypeptides having the 
in this regard »"'° l £ u ^ m sec out in Figures 1. 2 

^LT^. -d fragments of the variants, analogs and 

derivatives. _ regard are 

Further particularly preferred in this reg 
Further p iix variants . analogs, 

polynucleotides encoding hBSF I. II derivatives 
d erivatives and fragment JT^Ue. - « — 

amino acid residues are buu silent 

acneciallv preferred among tnese 
combination. Bspeciany p the 

« partitions and deletions, which do not 
substitutions, additions ana 

j .ftivities of the hBSF i, 11 
properties and activities rft „ aer vative substitutions. 

especially preferred ^ptidee 

^^jrsr^. - ngur.. x. . - . « » 

4 and 6). without substitutions. inve ntion are 

Further preferred embodiments of the mv 
Further p ?o% identical to a 

polynucleotides that are polyP eptides having 

polynucleotides encoding the hBSF I. an ^ ID 

polynucleotides. Alternatively, so eo% 

ptiynudeotides that cerise a region that is at 

^entical to a polypeptide encoding the ^ tidM 

^ pdna of the deposited clone ana poxy«« 
polypeptides of the cDNA of * polynucleotides at least 

complementary thereto. In this ly p re ferred. and among 

90% identical to the same are P a « icular ^ P ^ at least 

these particularly preferred polynucleotides, those 
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95% are especially preferred. Furthermore, chose with at least 97% 
are highly pref Hired among those with at least 95%, and among these 
those with at least 98% and at least 99% are particularly highly 
preferred, with at least 99% being the more preferred. 

Particularly preferred embodiments in this respect, moreover, 
are polynucleotides which encode polypeptides which retain 
substantially the same biological function or activity as the 
mature polypeptide encoded by the human cDNA of Figures 1, 2 and 
3 (SBQ ID NO: 1, 3 and 5) . 

The present invention further relates to polynucleotides that 
hybridize to the herein above -described sequences. In this regard, 
the present invention especially relates to polynucleotides which 
hybridize under stringent conditions to the herein above-described 
polynucleotides. As herein used, the term "stringent conditions" 
means hybridization will occur only if there is at least 95% and 
preferably at least 97% identity between the sequences. 



As discussed additionally herein regarding polynucleotide 
assays of the invention, for instance, polynucleotides of the 
invention as discussed above, may be used as a hybridization probe 
for cDNA and genomic DNA to isolate full-length cDNAs and genomic 
clones encoding hBSF I, II or III and to isolate cDNA and genomic 
clones of other genes that have a high sequence similarity to the 
human hESF I, II or III genes. Such probes generally will comprise 
at least 15 bases. Preferably, such probes will have at least 30 
bases and may have at least 50 bases. 

For example, the coding region of the hESF I, II and III genes 
may be isolated by screening using the known DNA sequence to 
synthesize an oligonucleotide probe. A labeled oligonucleotide 
having a sequence complementary to that of a gene of the present 
invention is then used to screen a library of human cDNA, genomic 
DNA or mRNA to determine which members of the library the probe 
hybridizes to. 

The polynucleotides and polypeptides of the present invention 
may be employed as research reagents and materials for discovery 
of treatments and diagnostics to human disease, as further 
discussed herein relating to polynucleotide assays, inter alia. 

The polynucleotides may encode a polypeptide which is the 
mature protein plus additional amino or carboxyl- terminal amino 
acids, or amino acids interior to the mature polypeptide (when the 
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mature form has more than one polypeptxde chain, for ^ 
Such sequences may P U, a role in processing of a protein fro. 
precursor to a mature ton.. may facilitate protein trarficicing. may 
prolong or shorten protein half -life or nay facilitate manipulation 
of a protein for assay or production, among other things. « 
generally is the case in situ, the additional amino acids may be 
processed away fro- the mature protein by cellular enzymes. 

A precursor protein, having the mature form of the polypeptide 
tu s.d to one or more proseouences may be an inactive fat. .of the 
polypeptide. when proseouences .re removed such inactive 
precursors generally are activated. Some or all of the 
proseouences may be removed before activation. Generally, such 
precursors are called proproteins. 

• in sum. a polynucleotide of the present invention may encode 
a mature protein, a mature protein plus a leader sequence Which 
ma, be referred to as a preproteinl . a precursor of a mstur. 
protein having one or more presentence, which are not the leader 
sequence, of a preprptein. or a preproprotein. which is a precursor 
to a ■ proprotein . having a leader sequence and 
pros.qu.nce., which generally are removed during processing steps 
that produce active and mature forms of the polypeptide 

Deposited material. ^ ^ 

A deposit containing a human nESr I, 11 a" 

deposited with the American Type Culture Collection, a. noted 
ahove. Also as noted above, the cDSA deposit is referred to herein 
a. -the deposited clone- or a. -the cDHA of the deposited clone 

1 deposited clone ... deposited with the Am.ric„ Type 
Culture CoUection. 12301 Par* Lawn Drive. RoOcille. ■toryland 
SlSTo*. o, January * . 1.M and ...igned ATCC Deposit No. 97.0!. 

(Stratagene. La Jolla. CA) containing the full lengtn n* 

^eTposit has been made under the term, of the Budapest 
Tre aty on the international recognition of the deposit of micro- 
organisms for purposes of patent procedure. The strai will be 
irrevocably and without restriction or condition released to the 
PuIIic upon the issuance of a patent. The deposit is provided 
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merely as convenience to those of skill in the art and is not an 
admission that a deposit is required for enablement, such as that 
required under 3 5 U.S.C. §x!2. 

The sequence of the polynucleotides contained in the deposited 
material, as well as the amino acid sequence of the polypeptide 
encoded thereby, are controlling in the event of any conflict with 
any description of sequences herein. 

A license may be required to make, use or sell the deposited 
materials, and no such license is hereby granted. 

Polypeptides 

The present invention further relates to a human hBSP I, ri 
and III polypeptide which has the deduced amino acid sequence of 
Figures 1, 2 and 3 (SBQ ID N0:2, 4 and 6) . 

The invention also relates to fragments, analogs and 
derivatives of these polypeptides. The terms "fragment," 
"derivative" and "analog" when referring to the polypeptide of 
Figures 1, 2 and 3 (SBQ ID NO: 2, 4 and 6) means a polypeptide which 
retains essentially the same biological function or activity as 
such polypeptide. Thus, an analog includes a proprotein which can 
be activated by cleavage of the proprotein portion to produce an 
active mature polypeptide. 

The polypeptide of the present invention may be a recombinant 
polypeptide, a natural polypeptide or a synthetic polypeptide. In 
certain preferred embodiments it is a recombinant polypeptide. 

The fragment, derivative or analog of the polypeptide of 
Figures 1, 2 and 3 (SBQ ID N0:2, 4 and 6) may be (i) one in which 
one or more of the amino acid residues are substituted with a 
conserved or non- conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid 
residue may or may not be one encoded by the genetic code, or (ii) 
one in which one or more of the amino acid residues includes a 
substituent group, or (iii) one in which the mature polypeptide is 
fused with another compound, such as a compound to increase the 
half -life of the polypeptide (for example, polyethylene glycol) , 
or (iv) one in which the additional amino acids are fused to the 
mature polypeptide , such as a leader or secretory sequence or a 
sequence which is employed for purification of the mature 
polypeptide or a proprotein sequence. Such fragments, derivatives 
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and analogs are deemed to be within the scope of those skilled in 
the art from the teachings herein. 

Among preferred variants are t:...,e that vary from a reference 
by conservative amino acid substitutions. Such substitutions are 
those that substitute a given amino acid in a polypeptide by 
another amino acid of like characteristics. Typically seen as 
conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala. Val, Leu and He,- interchange 
of the hydroxyl residues Ser and Thr. exchange of the acidic 
residues Asp and Glu. substitution between the amide residues Asn 
and Gin, exchange of the basic residues Lys and Arg and 
replacements among the aromatic residues Phe, Tyr. 

The polypeptides and polynucleotides of the present invention 
are preferably provided in an isolated form, and preferably are 

purified to homogeneity. 

The polypeptides of the present invention include the 
polypeptide of SBQ ID K0:2 (in particular the mature polypeptide, 
Is Jl as polypeptides which have at least 75* similarly 
(preferably at least 75% identity) to the polypeptide of SBQ ID 
N0:2 and more preferably at least 90% similarity (more Preferably 
at least 90% identity) to the polypeptide of SBQ ID N0:2 and stxl 
more preferably at least 95% similarity (still more preferably at 
least 95% identity) to the polypeptide of SBQ ID NO:2 and also 
include portions of such polypeptides with such portion ^of ^the 
polypeptide generally containing at least 30 amino acids and more 
preferably at least 50 amino acids. 

A. kLm in the art "similarity- between wo polypeptides x. 
decennial by comparing the amino acid .equence and it. conserved 
amino acid substitute, of on. polypeptide to the .equence of a 

second polypeptide. „«, fient 
Fragment, or portions of the polypeptide, of the present 

invention may be employed for producing the ^'^J* j""" 
length polypeptide by peptide synthesis.- therefore the fragment. 
M y be employed a. intermediate, for producing the 
poIypeptiTc. Fragment, or portion, of the polynucleotide, of the 
Prefer invention may be used to synthesize full-length 
polynucleotides of the present invention. 

Fragments 
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Also among preferred embodiments of this aspect of the present 
invention are polypeptides comprising fragments of hESF I, II and 
III, most particularly fragments of the hBSF I, II and III having 
the amino acid sequence set out in Figures 1, 2 and 3 (SEQ ID N0:2, 
4 and 6), and fragments of variants and derivatives of the hBSF I, 
II and III of Figures 1, 2 and 3 (SBQ ID NO: 2, 4 and 6) . 

In this regard a fragment is a polypeptide having an amino 
acid sequence that entirely is the same as part but not all of the 
amino acid sequence of the aforementioned hESF I, II and III 
polypeptides and variants or derivatives thereof. 

Such fragments may be "free-standing, n i.e., not part of or 
fused to other amino acids or polypeptides, or they may be 
comprised within a larger polypeptide of which they form a part or 
region. When comprised within a larger polypeptide, the presently 
discussed fragments most preferably form a single continuous 
region. However, several fragments may be comprised within a 
single larger polypeptide. For instance, certain preferred 
embodiments relate to a fragment of a hESF I, II or III polypeptide 
of the present comprised within a precursor polypeptide designed 
for expression in a host and having heterologous pre and pro- 
polypeptide regions fused to the amino terminus of the hESF I, II 
and III fragment and an additional region fused to the carboxyl 
terminus of the fragment. Therefore, fragments in one aspect of 
the meaning intended herein, refers to the portion or portions of 
a fusion polypeptide or fusion protein derived from hESF I, II and 
III. 

As representative examples of polypeptide fragments of the 
invention, there may be mentioned those which have from about 15 
to about 139 amino acids. 

In this context about includes the particularly recited range 
and ranges larger or smaller by several, a few, 5, 4, 3, 2 or 1 
amino acid at either extreme or at both extremes. Highly 
preferred in this regard are the recited ranges plus or minus as 
many as 5 amino acids at either or at both extremes. Particularly 
highly preferred are the recited ranges plus or minus as many as 
3 amino acids at either or at both the recited extremes. 
Especially preferred are ranges plus or minus 1 amino acid at 
either or at both extremes or the recited ranges with no additions 
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or deletions. Most highly preferred of all in this regard are 
fragments from auout 7.5 to about 45 amino acids. 

Among especially preferred fragments of the invention are 
truncation mutants of hESF I. II and III. Truncation mutants 
include hESF I. II and III polypeptides having the amino acxd 
sequence of Figures 2 and 3 (SEQ ID NO: 2 . 4 and 6) , or variants 
or derivatives thereof, except for deletion of a continuous series 
of residues (that is, a continuous region, part or portion) that 
includes the amino terminus, or a continuous series of residues 
that includes the carboxyl terminus or, as in double truncation 
mutants, deletion of two continuous series of residues, one 
including the amino terminus and one including the carboxyl 
terminus. Fragments having the size ranges set out about also are 
preferred embodiments of truncation fragments, which are especially 
preferred among fragments generally. 

Also preferred in this aspect of the invention are fragments 
characterized by structural or functional attributes of hBSF I. IX 
and III Preferred embodiments of the invention in this regard 
include fragments that comprise alpha-helix and alpha-helix forming 
regions (« alpha - r egions-) . beta-sheet and beta -sheet -forming 
regions (-beta-regions-), turn and turn-forming regions turn- 
regions",, coil and coil-forming regions 

hydrophilic regions, hydrophobic regions, alpha amphipathic 
regions, beta amphipathic regions, flexible regions, Re- 
forming regions and high antigenic index regions of hESF and 

' certain preferred regions in these regards are set out in 
Figures 4, 5 and 6 and include, but are not limited to, regions of 
the aforementioned types identified by analysis of the amino acid 
sequence set out in Figures i. 2 and 3 (SBQ ID N0:2, 4 and 6) As 
seT out in Figures 4. S and 6 such preferred regions include 
Gamier-Robson alpha -regions, beta-regions, turn- regions and coil - 
regions. Chou-Fasman alpha- regions, beta-regions and turn-regions 
Kyte-Doolittle hydrophilic regions and hydrophilic regions, 
slsenberg alpha and beta amphipathic regions. Karplus-Schulz 
flexible regions. Emini surf ace -forming regions and aameson-Wolf 
high antigenic index regions. 

Aaong highly preferred fragments in chis regard are a™*™ 
comprise regions of hESF I. II and III that contom. several 
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structural features, such as several of the features set out above. 
In this regard, the regions defined by the residues of Figures l, 
2 and 3 [SEQ ID N0:2 ( 4 and 6)/ which all are characterized by 
amino acid cpnpositions highly characteristic of turn-regions, 
hydrophilic regions, flexible-regions, surf ace -forming regions , and 
high antigenic index - regions , are especially highly preferred 
regions. Such regions may be comprised within a larger polypeptide 
or may be by themselves a preferred fragment of the present 
invention, as discussed above. It will be appreciated that the 
term "about" as used in this paragraph has the meaning set out 
above regarding fragments in general. 

Further preferred regions are those that mediate activities 
of hBSF I, II and in. Most highly preferred in this regard are 
fragments that have a chemical, biological or other activity of 
hBSF I, II and III, including those with a similar activity or an 
itqproved activity, or with a decreased undesirable activity. 
Highly preferred in this regard are fragments that contain regions 
that are homolbgs in sequence, or in position, or in both sequence 
and to active regions of related polypeptides, such as the related 
polypeptides set out in Figures 4, 5 and 6 (SBQ ID NO: 2, 4 and 6) 
and which include rat prostatic specif ic- binding proteins. Among 
particularly preferred fragments in these regards are truncation 
mutants, as discussed above. 

It will be appreciated that the invention also relates to, 
among others, polynucleotides encoding the aforementioned 
fragments, polynucleotides that hybridize to polynucleotides 
encoding the fragments, particularly those that hybridize under 
stringent conditions, and polynucleotides, such as PCR primers, for 
amplifying polynucleotides that encode the fragments. In these 
regards, preferred polynucleotides are those that correspondent to 
the preferred fragments, as discussed above. 

Vectors, host cell6, expression 

The present invention also relates to vectors which include 
polynucleotides of the present invention, host cells which are 
genetically engineered with vectors of rhe invention and the 
production of polypeptides of the invention by recombinant 
techniques. 
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.-■ Best cells can be genetically engineered to incorporate 
polynucleotides and express polypeptides of tne present 
Por instance, polynucleotides -y be introduced into host cells 
using -ell known technique, of infection. transduction, 
transection, transvection and transformation. Tne polynucleotides 
ray be introduced alone or with other polynucleotides. Such other 
polynucleotides may be introduced independently, co-introduced or 
introduced Joined to the polynucleotides of the invention. 

Thus, for instance, polynucleotides of the invention may be 
transfected Into host cells with another, separate polynucleotide 
encoding a .eatable marker, using standard technique, for co- 
transfectioa and selection in. for instance, na—Uan cells. In 
this case the polynucleotides generally will be stably incorporated 

into the host cell genome. 

Alternatively, the polynucleotides nay be Joined to a vector 
containing a selectable marker, for propagation in a ho.t^ The 
vector construct may be introduced into host th » 
aforementioned techniques. General*, a plasmid vector i. 
introduced as DNA in a precipitate, such as a calcium 
Precipitate, or in a convex with a charorf lipid, "^"-f » 
Ilso may be used to introduce polynucleotide, into a host. If eb. 
vector i. a virus, it may be packed in vitro or introduced into 
r Jacfcagins cell and Che packaged virus may be transduced into 
cens A wide variety of technique, suitable for making 
ptlynucleotide. and for introducing polynucleotides into ^ell.J* 
Lcordanc. with this aspect of the invweion are well known and 
routine to thos. of still in the art. Such technique, are reviewed 
iHength in Sambrook et .1. cited above, which is illu.tr.tive of 
the many laboratory manual, that detail these technique.. I n 
accordance with thi. aspect of the invention the vector may be for 
example, a plasmid vector, a single or doubl.-.tranded Phage 
: ror. a .l4l. or double-.tranded «. or "~ <^ J£ 
vectors may be introduced into cells a. polynucleotide.. *^**> 
DHA. by -11 known technique, for introducing » ^° 
cell.. The vectors, in the case of phage and vxra! ™ s * ls ° 
Ly be and preferably are introduced into cells a. packaged or 
Tcapsidated virus by well known techniques for infection and 
t3u«ion. Viral vectors may be replication competent or 
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replication defective. In the latter case vi^al propagation 
generally will occur only in complementing host cells. 

Pr^Jerred among vectors, in certain respects, are those for 
expression of polynucleotides and polypeptides of the present 
invention. Generally, such vectors comprise cis -acting control 
regions effective for expression in a host operatively linked to 
the polynucleotide to be expressed. Appropriate trans -acting 
factors either are supplied by the host, supplied by a 
complementing vector or supplied by the vector itself upon 
introduction into the host. 

In certain preferred embodiments in this regard, the vectors 
provide for specific expression. Such specific expression may be 
inducible expression or expression only in certain types of cells 
or both inducible and cell -specific. Particularly preferred among 
inducible vectors are vectors that can be induced for expression 
by environmental factors that are easy to manipulate, such as 
temperature and nutrient additives. A variety of vectors suitable 
to this aspect of the invention, including constitutive and 
inducible expression vectors for use in prokaryotic and eukaryotic 
hosts, are well known and employed routinely by those of skill in 
the art. 

The engineered host cells can be cultured in conventional 
nutrient media, which may be modified as appropriate for, inter 
alia, activating promoters, selecting transf ormants or amplifying 
genes. Culture conditions, such as temperature, pH and the like, 
previously used with the host cell selected for expression 
generally will be suitable for expression of polypeptides of the 
present invention as will be apparent to those of skill in the art. 

A great variety of expression vectors can be used to express 
a polypeptide of the invention. Such vectors include chromosomal, 
episomal and virus-derived vectors e.g., vectors derived from 
bacterial plasmids, from bacteriophage, from yeast episomes, from 
yeast chromosomal elements, from viruses such as baculoviruses , 
papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl 
pox viruses, pseudorabies viruses and retroviruses,, and vectors 
derived from combinations thereof, such as those derived from 
plasmid and bacteriophage genetic elements, such as cosmids and 
phagemids, all may be used for expression in accordance with this 
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aspect of the present invention. Generally, any vector suitable 
to maintain, propagate or express polynucleotides to express a 
polypeptide in a host may be used for expression in this regard. 

The appropriate DNA sequence may be inserted into the vector 
by any of a variety of well-known and routine techniques. In 
general, a DNA sequence for expression is joined to an expression 
vector by cleaving the DNA sequence and the expression vector with 
one or more restriction endonucleases and then joining the 
restriction fragments together using T4 DNA ligase. Procedures for 
restriction and ligation that can be used to this end are well 
known and routine to those of skill. Suitable procedures in this 
regard , and for constructing expression vectors using alternative 
techniques, which also are well known and routine to those skill, 
are set forth in great detail in Sambrook et al. cited elsewhere 
herein . 

The DNA sequence in the expression vector is operatively 
linked to appropriate expression control sequence (6), including, 
for instance, a promoter to direct mRNA transcription. 
Representatives of such promoters include the phage lambda PL 
promoter, the E. coli lac, trp and tac promoters, the SV40 early 
and late promoters and promoters of retroviral LTRfi, to name just 
a few of the well-known promoters. It will be understood that 
numerous promoters not mentioned are suitable for use in this 
aspect of the invention are well known and readily may be employed 
by those of skill in the manner illustrated by the discussion and 
the examples herein. 

In general, expression constructs will contain sites for 
transcription initiation and termination, and, in the transcribed 
region, a ribosome binding site for translation. The coding 
portion of the mature transcripts expressed by the constructs will 
include a translation initiating AUG at the beginning and a 
termination codon appropriately posit »ned at the end of the 
polypeptide to be translated. 

In addition, the constructs may contain control regions that 
regulate as well as engender expression. Generally, in accordance 
with many conmonly practiced procedures, such regions will operate 
by controlling transcription, such as repressor binding sites and 
enhancers, among others. 
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Vectors for propagation and expression generally will include 
selectable markers. Such markers also may be suitable Tor 
amplification or Liie vectors may contain additional markers for 
this purpose. In this regard, the expression vectors preferably 
contain one or more selectable marker genes to provide a phenotypic 
trait for selection of transformed host cells. Preferred marker:; 
include dihydrof olate reductase or neomycin resistance for 
eukaryotic cell culture, and tetracycline or ampicillin resistance 
genes for culturing E. coli and other bacteria. 

The vector containing the appropriate DNA sequence as 
described elsewhere herein, as well as an appropriate promoter, and 
other appropriate control sequences, may be introduced into an 
appropriate host using a variety of well known techniques suitable 
to expression therein of a desired polypeptide. Representative 
exanples of appropriate hosts include bacterial cells, such as B. 
coli # Streptomyces and Salmonella typhimurium cells; fungal cells, 
such as yeast cells; insect cells such as Drosophila S2 and 
Spodoptera Sf9 cells; animal cells such as CHO, COS and Bowes 
melanoma cells; and plant cells. Hosts for of a great variety of 
expression constructs are well known, and those of skill will be 
enabled by the present disclosure readily to select a host for 
expressing a polypeptides in accordance with this aspect of the 
present invention. 

More particularly, the present invention also includes 
recombinant constructs, such as expression constructs, comprising 
one or more of the sequences described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which 
such a sequence of the invention has been inserted. The sequence 
may be inserted in a forward or reverse orientation. In certain 
preferred embodiments in this regard, the construct further 
comprises regulatory sequences, including, for example, a promoter, 
operably linked to the sequence. Large numbers of suitable 
vectors and promoters are known to those of skill in the art, and 
there are many convnercially available vectors suitable for use in 
the present' invention. 

The following vectors, which are commercially available, are 
provided by way of example. Among vectors preferred for use in 
bacteria are pQE70, pQE60 and pQE-9, available from Qiagen; pBS 
vectors, Phagescript vectors, Bluescript vectors, pNHBA, pNH16a, 
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pNHl8A, pNH46A, available from Stratagene; and ptrc99a, pKK223-3. 
pKX233-3. pDR540, pRITS available from Pharmacia. Among preferred 
eukaryotic vectors are pWLNEO. pSV2CAT, pOG44, pXTl and pSG 
available from Stratagene; and pSVK3. pBPV, pMSG and pSVL available 
from Pharmacia. These vectors are listed solely by way of 
illustration of the many commercially available and well known 
vectors that are available to those of skill in the art for use in 
accordance with this aspect of the present invention. It will be 
appreciated that any other plasmid or vector suitable for, for 
example, introduction, maintenance, propagation or expression of 
a polynucleotide or polypeptide of the invention in a host may be 
used in this aspect of the invention. 

Promoter regions can be selected from any desired gene using 
vectors that contain a reporter transcription unit lacking a 
promoter region, such as a chloramphenicol acetyl transferase 
("cat") transcription unit, downstream of restriction site or sites 
for introducing a candidate promoter fragment; i.e., a fragment 
that may contain a promoter. As is well known, introduction into 
the vector of a promoter-containing fragment at the restriction 
site upstream of the cat gene engenders production of CAT 
activity, which can be detected by standard CAT assays, vectors 
suitable to this end are well known and readily available. Two 
such vectors are pKX232-8 and pCM7. Thus, promoters for expression 
of polynucleotides of the present invention include not only well 
known and readily available promoters, but also promoters that 
readily may be obtained by the foregoing technique, using a 
reporter gene. 

Among known bacterial promoters suitable for expression of 
polynucleotides and polypeptides in accordance with the present 
invention are the B. coli lad and lacZ and promoters, the T3 and 
T7 promoters, the gpt promoter, the lambda PR, PL promoters and the 
trp promoter. Among known eukaryotic promoters suitable in 

this regard are the CMV immediate early promoter, the HSV thymidine 
kinase promoter, the early and late SV40 promoters, the promoters 
of retroviral LTRS. such as those of the Rous sarcoma virus 

("RSV"). and metallothionein promotes, such as the mouse 
metallothionein-I promoter. 

Selection of appropriate vectors and promoters for expression 

in a host cell is a well known procedure and the requisite 
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techniques for expression vector construction, introduction of the 
vector into the host and expression in the host are routine skills 
in the art . 

The present invention also relates to host cells containing 
the above -described constructs discussed above. The hose cell can 
be a higher eukaryotic cell, such as a mammalian cell, or a lower 
eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. 

Introduction of the construct into the host cell can be 
effected by calcium phosphate transf ection, DEAE-dextran mediated 
trans feet ion, cat ionic lipid -media ted transf ection, 
electroporation, transduction, infection or other methods. Such 
methods are described in many standard laboratory manuals, such as 
Davis et al. BASIC METHODS IN MOLECULAR BIOLOGY, (1986) . 

Constructs in host cells can be used in a conventional manner 
to produce the gene product encoded by the recombinant sequence. 
Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, yeast, 
bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs 
of the present invention. Appropriate cloning and expression 
vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2nd 
Bd. , Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 
(1989) . 

Generally, recombinant expression vectors will include origins 
of replication, a promoter derived from a highly- expressed gene to 
direct transcription of a downstream structural sequence, and a 
selectable marker to permit isolation of vector containing cells 
after exposure to the vector. Among suitable promoters are those 
derived from the genes that encode glycolytic enzymes such as 3- 
phosphoglycerate kinase ( "PGK") , a-f actor, acid phosphatase, and 
heat shock proteins, among others. Selectable markers include the 
ampicillin resistance gene of E. coli and the trpl gene of S. 
cerevisiae. 

Transcription of the DNA encoding the polypeptides of the 
present invention by higher eukaryotes may be increased by 
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inserting an enhancer sequence into the vector. Enhancers are cis- 
acting elements of DNA. usually about from 10 to 300 bp that act 
to increase transcriptional activity of a promoter in a given host 
cell-type. Examples of enhancers include the SV40 enhancer, which 
is located on the late side of the replication origin at bp 100 to 
270, the cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and adenovirus 
enhancers. 

Polynucleotides of the invention, encoding the heterologous 
structural sequence of a polypeptide of the invention generally 
will be inserted into the vector using standard techniques so that 
it is operably linked to the promoter for expression. The 
polynucleotide will be positioned so that the transcription start 
site is located appropriately 5' to a ribosome binding site. The 
ribosome binding site will be 5' to the AUG that initiates 
translation of the polypeptide to be expressed. Generally, there 
will be no other open reading frames that begin with an initiation 
codon, usually AUG, and lie between the ribosome binding site and 
the initiating AUG. Also, generally, there will be a translation 
stop codon at the end of the polypeptide and there will be a 
polyadenylation signal and a transcription termination signal 
appropriately disposed at the 3' end of the transcribed region. 

For secretion of the translated protein into the lumen of the 
endoplasmic reticulum, into the periplastic space or into the 
extracellular environment, appropriate secretion signals may be 
incorporated into the expressed polypeptide. The signals may be 
endogenous to the polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as 
a fusion protein, and may include not only secretion signals but 
also additional heterologous functional regions. Thus, for 
instance, a region of additional amino acids, particularly charged 
amino acids, may be added to the N- terminus of the polypeptide to 
improve stability and persistence in the host cell, during 
purification or during subsequent handling and storage. Also, 
region also may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final 
preparation of the polypeptide. The addition of peptide moieties 
to polypeptides to engender secretion or excretion, to improve 
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stability and to facilitate purification, among others, are 
familiar and routine techniques in -he art. 

Suitable prokaryotic hosts :~ir propagation, maintenance or 
expression of, polynucleotides and polypeptides in accordance with 
the invention include Bscherischia coli, Bacillus subtilis and 
Salmonella typhimuriura. Various species of Pseudomonas, 

Streptomyces, and Staphylococcus are suitable hosts in this regard. 
Moreover, many other hosts also known to those of skill may be 
employed in this regard. 

As a representative but non-limiting example, useful 
expression vectors for bacterial use can comprise a selectable 
marker and bacterial origin of replication derived from 
commercially available plastnids comprising genetic elements of the 
well known cloning vector pBR322 (ATCC 37017) . Such cotmnercial 
vectors include, for example, pKK223-3 (Pharmacia Pine Chemicals, 
Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, WI, USA). 
These pBR322 "backbone" sections are combined with an appropriate 
promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth 
of the host strain to an appropriate cell density, where the 
selected promoter is inducible it is induced by appropriate means 
(e.g. , temperature shift or exposure to chemical inducer) and cells 
are cultured for an additional period. 

Cells typically then are harvested by centrifugation, 
disrupted by physical or chemical means, and the resulting crude 
extract retained for further purification. 

Microbial cells employed in expression of proteins can be 
disrupted by any convenient method, including freeze- thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents, 
such methods are well know to those skilled in the art. 

Various mammalian cell culture systems can be employed for 
expression, as well. Examples of raattmalian expression systems 
include the COS-7 lines of monkey kidney fibroblast, described in 
Gluzman et al., Cell 23: 175 (1981). Other cell lines capable of 
expressing a compatible vector include for example, the C127, 3T3, 
CHO, HeLa, human kidney 293 and BHK cell lines. 

Mammalian expression vectors will comprise an origin of 
replication, a suitable promoter and enhancer, and also any 
necessary ribosorae binding sites, polyadenylation sites, splice 
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donor and acceptor sites, transcriptional termination sequences, 
and S- flanking non- transcribed sequences that are necessar/ to* 
expression. In certain preferred embodiments in this regard DNA 
sequences derived from the SV40 splice sites, and the SV40 
polyadenylation sites are used for required non- transcribed genetic 

elements of these types. 

The hBSF I. II and III polypeptide can be recovered and 
purified from recombinant cell cultures by well-known methods 
including ammonium sulfate or ethanol precipitation. acid 
extraction. anion or cation exchange chromatography, 
phosphocellulose chromatography. hydrophobic interaction 
chromatography, affinity chromatography. ^f™^" 
chromatography and lectin chromatography. Most preferably hxgh 
performance liquid chromatography ("HPLC") is employed for 
purification. Well known techniques for refolding protein may be 
employed to regenerate active conformation when the polypeptide x. 
denatured during isolation and or purification. natlirallv 
Polypeptides of the present invention include naturally 
purified products, products of chemical synthetic procedures, and 
products produced by recombinant techniques from a prokaryotxc or 
eukaryotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. Depending upon the host 
employed in a recombinant production procedure, the polypeptxdes 
o7 the present invention may be glycosylated or may be non- 
glycosylated. In addition, polypeptides of the invention may also 
include an initial modified methionine residue, in some cases as 
a result of host -mediated processes. 

MS? I II and III polynucleotides and polypeptides may be 
used in accordance with the present invention for a variety of 
applications, particularly those that make use of the chemical and 
biological properties hBSP I, II and III. Additional applications 
relaH to diagnosis and to treatment of disorders of ™ 
and organisms. These aspects of the invention are illustrated 
further by- the following discussion. 



Polynucleotide assays 

This invention is also related to the use of the hBSF I. II 
and III polynucleotides to detect complementary polynucleotides 
such as, for example, as a diagnostic reagent. Detection of a 
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mutated form of hESF I, II and III associated with a dysfunction 
will provide a diagnostic tool that can add or define a diagnosis 
of a disease or susceptibility to a disease which results from 
under-expression, over-expression or altered expression of hESF I, 
II and III, such as, for example, a susceptibility to inherited 
asthma and endometrial cancer. 

Individuals carrying mutations in the human hESF I, II and III 
gene may be detected at the DNA level by a variety of techniques. 
Nucleic acids for diagnosis may be obtained from a patient's cells, 
such as from blood, urine, saliva, tissue biopsy and autopsy 
material. The genomic DNA may be used directly for detection or 
may be amplified enzymatically by using PGR prior to analysis. PGR 
(Saiki et al. ( Nature, 324: 163-166 (1986)). RNA or cDNA may also 
be used in the same ways. As an example, PCR primers complementary 
to the nucleic acid encoding hESF I, II and III can be used to 
identify and analyze hESF I, II and III expression and mutations. 
For example, deletions and insertions can be detected by a change 
in size of the amplified product in comparison to the normal 
genotype. Point mutations can be identified by hybridizing 
amplified DNA to radiolabeled hESF I, II and III RNA or 
alternatively, radiolabeled hBSP I, II and III antisense DNA 
sequences. Perfectly matched sequences can be distinguished from 
mismatched duplexes by RNase A digestion or by differences in 
melting temperatures. 

Sequence differences between a reference gene and genes having 
mutations also may be revealed by direct DNA sequencing. In 
addition, cloned DNA segments may be employed as probes to detect 
specific DNA segments. The sensitivity of such methods can be 
greatly enhanced by appropriate use of PCR or another amplification 
method. For example, a sequencing primer is used with double- 
stranded PCR product or a single-stranded template molecule 
generated by a modified PCR. The sequence determination is 
performed by conventional procedures with radiolabeled nucleotide 
or by automatic sequencing procedures with fluorescent -tags . 

Genetic testing based on DNA sequence differences may be 
achieved by detection of alteration in electrophoretic mobility of 
DNA fragments in gels, with or without denaturing agents. Small 
sequence deletions and insertions can be visualized by high 
resolution gel electrophoresis. DNA fragments of different 
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sequences may be distinguished on denaturing formamide gradient 
gels in which the mobilities of different DMA fragments are 
retarded in the gel at different positions according to their 
specific melting or .partial melting temperatures (see, e.g.. Myers 
et al., Science. 230: 1242 (1985)). 

Sequence changes at specific locations also may be revealed 
by nuclease protection. assays, such as RNase and Si protection or 
the chemical cleavage method (e.g.. Cotton et al., Proc. Natl. 
Acad. Sci.. USA. 85: 4397-4401 (1985)). 

Thus the detection of a specific DMA sequence may be achieved 
by methods such as hybridization. RNase protection, chemical 
cleavage, direct DNA sequencing or the use of restriction enzymes, 
(e.g.. restriction fragment length polymorphisms ( "RFLP ) and 
Southern blotting of genomic DNA. 

In addition to more conventional gel-electrophoresis and DNA 
sequencing, mutations also can be detected by in situ analysis. 

Chromosome assays 

The sequences of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted 
to and can hybridize with a particular location on an individual 
human chromosome. Moreover, there is a current need for 
identifying particular sites on the chromosome. Few chromosome 
narking reagents based on actual sequence data (-peat 
polymorphisms) are presently available for marking chromosomal 
location. The mapping of DNAs to chromosomes according to the 
present invention is an important first step in correlating those 
sequences with genes associated with disease. 

in certain preferred embodiments in this regard, the cDNA 
herein disclosed is used to clone genomic DNA of a hBSP I, II and 
III gene. This can be accomplished using a variety of well known 
techniques and libraries. which generally are available 
commercially. The genomic DNA the is used for in situ chromosome 
mapping using well known techniques for this purpose. Typically. 
• in accordance with routine procedures for chromosome mapping, some 
trial and error may be necessary to identify a genomic probe that 
gives a good in situ hybridization signal. 

in some cases, in addition, sequences can be mapped to 
chromosomes by preparing PGR primers (preferably 15-25 bp) from the 
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cDNA. Computer analysis of the 3' untranslated region of the gene 
is used to rapidly select primers that do not span more than one 
exon in the genomic DNA, thus complicating the amplification 
process. These primers are tiitsn used for PCR screening of somatic 
cell hybrids containing individual human chromosomes. Only those 
hybrids containing the human gene corresponding to the primer will 
yield an amplified fragment. 

PCR mapping of somatic cell hybrids is a rapid procedure for 
assigning a particular DNA to a particular chromosome. Using the 
present invention with the same oligonucleotide primers, 
sublocalization can be achieved with panels of fragments from 
specific chromosomes or pool6 of large genomic clones in an 
analogous manner. Other mapping strategies that can similarly be 
used to map to its chromosome include in situ hybridization, 
prescreening with labeled flow-sorted chromosomes and preselection 
by hybridization to construct chromosome specif ic-cDNA libraries. 

Fluorescence in situ hybridization ("PISH") of a cONA clone 
to a me taphase chromosomal spread can be used to provide a precise 
chromosomal location in one step. This technique can be used with 
cDNA as short as 50 or 60. For a review of this technique, see 
Verma et al. , HUMAN CHROMOSOMES : A MANUAL OF BASIC TECHNIQUES , 
Pergamon Press, New York (1988). 

Once a sequence has been mapped to a precise chromosomal 
location, the physical position of the sequence on the chromosome 
can be correlated with genetic map data. Such data are found, for 
example, in V. McKusick, MKNDBLIAN INHERITANCE IN MAN, available 
on line through Johns Hopkins University, Welch Medical Library. 
The relationship between genes and diseases that have been mapped 
to the same chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes) . 

Next, it is necessary to determine the differences in the cDNA 
or genomic sequence between affected and unaffected individuals. 
If a mutation is observed in some or all of the affected 
individuals but not in any normal individuals, then the mutation 
is likely to be the causative agent of the disease. 

With current resolution cf physical mapping and genetic 
mapping techniques, a cDNA precisely localized to a chromosomal 
region associated with the disease could be one of between 50 and 
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500 potential causative genes. (Thas assumes 1 megabase mapping 
resolution and one gene per 20 kb) . 

Polypeptide assays 

The present invention also relates to a diagnostic assays such 
as quantitative and diagnostic assays for detecting levels of hBSF 
I, II and III protein in cells and tissues, and biological fluids 
such, for example, as blood and urine, including determination of 
normal and abnormal levels. Thus, for instance, a diagnostic assay 
in accordance with the invention for detecting over -expression or 
under- expression of hBSF I, II and III protein compared to normal 
control tissue samples may be used to detect the presence of 
neoplasia, for example. Assay techniques that can be used to 
determine levels of a protein, such as an hBSF I. II and III 
protein of the present invention, in a sample derived from a host 
are well-known to those of skill in the art. Such assay methods 
include radioimmunoassays, competitive -binding assays. Western Blot 
analysis and BLISA assays. Among these ELISAb frequently are 
preferred. An BLISA assay initially comprises preparing an 
antibody specific to hESF I. II or III. preferably a monoclonal 
antibody. In addition a reporter antibody generally is prepared 
which binds to the monoclonal antibody. The reporter antibody is 
attached a detectable reagent such as radioactive, fluorescent or 
enzymatic reagent, for example horseradish peroxidase enzyme. 

To carry out an BLISA a sample is removed from a host and 
incubated on a solid support, e.g. a polystyrene dish, that binds 
the proteins in the sample. Any free protein binding sites on the 
dish are then covered by incubating with a non-specific protein 
such as bovine serum albumin. Next, the monoclonal antibody is 
incubated in the dish during which time the monoclonal antibodies 
attach to any hBSP I. II or III proteins attached to the 
polystyrene dish. Unbound monoclonal antibody is washed out with 
buffer. The reporter antibody linked to horseradish peroxidase is 
placed in the dish resulting in binding of the reporter antibody 
to any monoclonal antibody bound to hBSF I. II or III. Unattached 
reporter antibody is then washed out. Reagents for peroxidase 
activity, including a colorimetric substrate are then added to the 
dish, immobilized peroxidase, linked to hBSF I. II or III through 
the primary and secondary antibodies, produces a colored reaction 
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product. The amount of color developed in a given time period 
indicates the amount of hESF I, II or III protein present in the 
sample. Quantitative results typically are obtained by reference 
to a standard curve. 

A competition assay may be employed wherein antibodies 
specific to hESF I , II or III attached to a solid support and 
labeled hESF I, II or III and a sample derived from the host are 
passed over the solid support and the amount of label detected 
attached to the solid support can be correlated to a quantity of 
hESF I, II or III in the sample. 

Antibodies 

The polypeptides, their fragments or other derivatives, or 
analogs thereof, or cells expressing them can be used as an 
immunogen to produce antibodies thereto. These antibodies can be, 
for example, polyclonal or monoclonal antibodies. The present 
invention also includes chimeric, single chain, and humanized 
antibodies, as well as Fab fragments, or the product of an Fab 
expression library. Various procedures known in the art may be 
used for the production of such antibodies and fragments. 

Antibodies generated against the polypeptides corresponding 
to a sequence of the present invention can be obtained by direct 
injection cf the polypeptides into an animal or by administering 
the polypeptides to an animal, preferably a nonhuman. The antibody 
so obtained will then bind the polypeptides itself. In this 
manner, even a sequence encoding only a fragment of the 
polypeptides can be U6ed to generate antibodies binding the whole 
native polypeptides. Such antibodies can then be used to isolate 
the polypeptide from tissue expressing that polypeptide. 

For preparation of monoclonal antibodies, any technique which 
provides antibodies produced by continuous cell line cultures can 
be used. Examples include the hybridoraa technique (Kohler, G. and 
Milstein, c, Nature 256: 495-497 (1975), the trioma technique, the 
human B-cell hybridoma technique (Kozbor et al., Immunology Today 
4: 72 (1983) and the BBV-hybridoma technique to produce human 
monoclonal antibodies (Cole et al., pg. 77-96 in MONOCLONAL 
ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc. (1985) . 

Techniques described for the production of single chain 
antibodies (U.S. Patent No. 4,946,778) can be adapted to produce 



-41- 



WO 97/34997 



PCT/US96/03857 



single chain antibodies to immunogenic polypeptide products of this 
invention. Also, cransg^nic mice, or other organisms such as other 
manmals, may be used to express humanized antibodies to immunogenic 
polypeptide products of this invention. 

The above-described antibodies may be employed to isolate or 
to identify clones expressing the polypeptide or purify the 
polypeptide of the present invention by attachment of the antibody 
to a solid support for isolation and/or purification by affinity 
chr omo t ography . 

Thus, among others, the polynucleotides and polypeptides of 
the present invention may be employed to prevent and/or treat 
inflammation, asthma,, rhinitis, cystic fibrosis, airway disease, 
prevent and/ or treat neoplasia, atopy, inhibit phospholipase A ? , 
bind polychlorated biphenyls, reduce foreign protein antigenicity, 
inhibit monocyte and neutrophil chemotaxis and phagocytosis, 
inhibit platelet aggregation, regulate eicosanoid levels in the 
human uterus, control the growth of endometrial cells. 

hBSP I, II and III binding molecules and assays 
This invention also provides a method for identification of 
molecules, such as receptor molecules, that bind hBSP I, II and 
III. Genes encoding proteins that bind hBSP I, II and III. such 
as receptor proteins, can be identified by numerous methods known 
to those of skill in the art, for example, ligand panning and PACS 
sorting. Such methods are described in many laboratory manuals 
such as, for instance. Coligan et al., Current Protocols in 
Immunology 1(2): Chapter 5 (1991). 

For instance, expression cloning may be employed for this 
purpose. To this end polyadenylated UNA is prepared from a cell 
responsive to hBSP I. II and III. a cDNA library is created from 
this RNA, the library is divided into pools and the pools are 
transf ected individually into cells that are not responsive to hBSP 
I, II and III. The transf ected cells then are exposed to labeled 
hBSP I, II and III. (hBSP I. II and III can be labeled by a 
variety of well-known techniques including standard methods of 
radio- iodinat ion or inclusion of a recognition site for a site- 
specific protein kinase.) Following exposure, the cells are fixed 
and binding of cytostatin is determined. These procedures 
conveniently are carried out on glass slides. 
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Pools are identified of cDNA that produced hBSF I, II and III- 
binding cells. Sub-pools are prepared from these positives, 
transfected into host cells and screened as described above. Using 
an iterative sub-pooling and re-screening process, one or more 
single clones that encode the putative binding molecule, such as 
a receptor molecule, can be isolated. 

Alternatively a labeled ligand can be photoaf f inity linked to 
a cell extract, such as a membrane or a membrane extract, prepared 
from cells that express a molecule that it binds, such as a 
receptor molecule. Cross-linked material is resolved by 
polyacrylamide gel electrophoresis ("PAGE") and exposed to X-ray 
film. The labeled complex containing the ligand- receptor can be 
excised, resolved into peptide fragments, and subjected to protein 
micro sequencing. The amino acid sequence obtained from 
microsequencing can be used to design unique or degenerate 
oligonucleotide probes to screen cDNA libraries to identify genes 
encoding the putative receptor molecule. 

Polypeptides of the invention also can be used to assess hBSP 
I, II and III binding capacity of hBSF I , II and III binding 
molecules, such as receptor molecules, in cells or in cell-free 
preparations. 

Agonists and antagonists - assays and molecules 
The invention also provides a method of screening compounds 
to identify those which enhance or block the action of hBSP I, II 
and III on cells, such as its interaction with hBSP I, II and Ill- 
binding molecules such as receptor molecules. An agonist is a 
compound which increases the natural biological functions of hBSP 
I, II and III or which functions in a manner similar to hBSP I, II 
and III, while antagonists decrease or eliminate such functions. 

Por example, a cellular compartment, such as a membrane or a 
preparation thereof, such a6 a membrane -preparation, may be 
prepared from a cell that expresses a molecule that binds hBSP I, 
II and III, such as a molecule of a signaling or regulatory pathway 
modulated by hBSP I, II and III. The preparation is incubated with 
labeled hBSF I, II and III in the absence or the presence of a 
candidate molecule which may be a hBSP I, II and III agonist or 
antagonist. The ability of the candidate molecule to bind the 
binding molecule is reflected in decreased binding of the labeled 
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Vigand Molecules which hind gratuitously, i.e., without inducing 
the effects of hESF I. -II and III on binding the hBSP I. II and III 
binding molecule, are most likely to be good antagonists. 
Molecules that bind well and elicit effects that are the same as 
or closely related to hBSP I. II and III are agonists. 

hBSP I II and Ill-like effects of potential agonists and 
antagonists 'may by measured, for instance, by determining activity 
of a second messenger system following interaction of the candidate 
molecule with a cell or appropriate cell preparation, and comparing 
the effect with that of hESF I. II and III or molecules that elicit 
the same effects as hBSP I. II and III. Second messenger systems 
that may be useful in this regard include but are not limited to 
AMP guanylate cyclase, ion channel or phosphoinositide hydrolysis 
second messenger systems. 

Another example of an assay for hBSP I , II and III antagonists 
is a competitive assay that combines hBSF I, II and III and a 
potential antagonist with membrane -bound hBSF X. IX and III 
receptor molecules or recombinant hBSP I. II a^id III receptor 
molecules under appropriate conditions for a competitive inhibition 
assay. . hBSF I. II and III can be labeled, such as by 
radioactivity, such that the number of hBSP I, II and III molecules 
bound to a receptor molecule can be determined accurately to assess 
the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, 
peptides, polypeptides and antibodies that bind to a polypeptide 
of the invention and thereby inhibit or extinguish its activity. 
Potential antagonists also may be small organic molecules, a 
peptide, a polypeptide such as a closely related protein or 
antibody that binds the same sites on a binding molecule, such as 
a receptor molecule, without inducing hBSF I. II — i"" 1 "* 6 * 
activities, thereby preventing the action of hBSF I. II and XXX by 
excluding hBSP I, II and III from binding. 

Potential antagonists include a small molecule which binds to 
and occupies the binding site of the polypeptide thereby preventing 
binding to cellular binding molecules, such as receptor molecules 
such that normal biological activity is prevented. Examples of 
small molecules include but are not limited to small organic 
molecules, peptides or peptide- like molecules. 
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Other potential antagonists include antisense molecules. 
Antisense technology can be used to control gene expression through 
anticens^ DNA or RNA or through triple-helix formation. Antisense 
techniques are discussed, for example, in - Okano, J. Neurochem. 
56: 560 (1991); OL I GODEOX YNUCLEOTI DBS AS ANTISENSE INHIBITORS OF 
GENE EXPRESSION, CRC Press, Boca Raton, PL (1988) . Triple helix 
formation is discussed in, for instance Lee et al., Nucleic Acids 
Research 6: 3073 (1979); Cooney et al . , Science 241: 456 (1988); 
and Dervan et al. , Science 251: 1360 (1991) . The methods are based 
on binding of a polynucleotide to a complementary DNA or RNA. For 
example, the 5' coding portion of a polynucleotide that encodes the 
mature polypeptide of the present invention may be used to design 
an antisense RNA oligonucleotide of from about 10 to 40 base pairs 
in length. A DNA oligonucleotide is designed to be complementary 
to a region of the gene involved in transcription thereby 
preventing transcription and the production of hESF I, II and III. 
The antisense RNA oligonucleotide hybridizes to the mRNA in vivo 
and blocks translation of the mRNA molecule into hESF I, II and III 
polypeptide. The oligonucleotides described above can also be 
delivered to cells such that the antisense RNA or DNA may be 
expressed in vivo to inhibit production of hESF I, II and III. 

The antagonists may be employed in a composition with a 
pharmaceutically acceptable carrier, e.g., as hereinafter 
described . 

The antagonists may be employed for instance to treat an 
inherited susceptibility to asthma. 

Compositions 

The invention also relates to conqpositions comprising the 
polynucleotide or the polypeptides discussed above or the agonists 
or antagonists. Thus, the polypeptides of the present invention 
may be employed in combination with a non-sterile or sterile 
carrier or carriers for use with cells, tissues or organisms, such 
as a pharmaceutical carrier suitable for administration to a 
subject. Such compositions comprise, for instance, a media 
additive or a therapeutically effective amount of a polypeptide of 
the invention and a pharmaceutically acceptable carrier or 
excipient. Such carriers may include, but are not limited to, 
saline, buffered saline, dextrose, water, glycerol, ethanol and 
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.combinations thereof. The formulation should suit the mode of 
administration. 

Kits 

The invention further relates to pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the 
ingredients of the aforementioned compositions of the invention. 
Associated with such container (s) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use 
or sale of pharmaceuticals or biological products, reflecting 
approval by the agency of the manufacture, use or sale of the 
product for human administration. 



Administration 

Polypeptides and other compounds of the present invention may 
be employed alone or in conjunction with other compounds, such as 
therapeutic compounds. 

The pharmaceutical compositions may be adn^inistered in any 
effective, convenient manner including, 1 for instance, 
administration by topical, oral, anal, vaginal, intravenous, 
intraperitoneal , intramuscular , subcutaneous , intranasal or 
intradermal routes among others. 

The pharmaceutical compositions generally are administered in 
an amount effective for treatment or prophylaxis of a specific 
indication or indications. In general, the compositions are 
administered in an amount of at least about 10 pg/Ks bod y weight. 
In most cases they will be administered in an amount not in excess 
of about 8 mg/kg body weight per day. Preferably, in most cases, 
dose is from about 10 m9/*9 to about 1 mg/kg body weight, daily. 
It will be appreciated that optimum dosage will be determined by 
standard methods for each treatment modality and indication, taking 
into account the indication, its severity, route of administration, 
complicating conditions and the like. 



Gene therapy 

The hBSF I, II and III polynucleotides, polypeptides, agonists 
and antagonists that are polypeptides may be employed in accordance 
with the present invention by expression of such polypeptides in 
vivo, in treatment modalities often referred to as "gene therapy. " 
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Thus, for example, cells from a patient may be engineered with 
a polynucleotide, such as a DNA or RNA, encoding a polypeptide rv 
vivo, and the engineered cells then can be provided to a patient 
to be treated with the polypeptide. For example, cells may be 
engineered ex vivo by the use of a retroviral plasmid vector 
containing RNA encoding a polypeptide of the present invention. 
Such methods are well-known in the art and their use in the piresent 
invention will be apparent from the teachings herein. 

Similarly, cells may be engineered in vivo for expression of 
a polypeptide in vivo by procedures Known in the art. For example, 
a polynucleotide of the invention may be engineered for expression 
in a replication defective retroviral vector, as discussed above. 
The retroviral expression construct then may be isolated and 
introduced into a packaging cell is transduced with a retroviral 
plasmid vector containing RNA encoding a polypeptide of the present 
invention such that the packaging cell now produces infectious 
viral particles containing the gene of interest. These producer 
cells may be administered to a patient for engineering cells in 
vivo and expression of the polypeptide in vivo. These and other 
methods for administering a polypeptide of the present invention 
by such method should be apparent to those skilled in the art from 
the teachings of the present invention. 

Retroviruses from which the retroviral plasmid vectors herein 
above mentioned may be derived include, but are not limited to, 
Moloney Murine Leukemia Virus, spleen necrosis virus, retroviruses 
such as Rous Sarcoma Virus, Harvey Sarcoma Virus, avian leukosis 
virus, gibbon ape leukemia virus, human inrounodef iciency virus, 
adenovirus, Myeloproliferative Sarcoma Virus, and maimnary tumor 
virus. In one embodiment, the retroviral plasmid vector is derived 
from Moloney Murine Leukemia Virus. 

Such vectors well include one or more promoters for expressing 
the polypeptide. Suitable promoters which may be employed include, 
but are not limited to, the retroviral LTR; the SV40 promoter; and 
the human cytomegalovirus (CMV) promoter described in Miller et 
al., Biotechniques 7: 980-990 (1989), or any other promoter (e.g., 
cellular promoters such as eukaryotic cellular promoters including, 
but not limited to, the histone, RNA polymerase III, and B-actin 
promoters) . Other viral promoters which may be employed include, 
but are not limited to, adenovirus promoters, thymidine kinase (TK) 
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promoters, and Bl 9 parvovirus promoters. The selection of a 
suitable promoter will be apparent to those skilled in the art from 
the teachings contained herein. 

The nucleic, acid sequence encoding the polypeptide of the 
present invention will be placed under the control of a suitable 
promoter. Suitable promoters which may be employed include, but 
are not limited to. adenoviral promoters, such as the adenoviral 
major late promoter; or heterologous promoters, such as the 
cytomegalovirus (CMV) promoter; the respiratory syncytial virus 
(RSV) promoter; inducible promoters, such as the MMT promoter, the 
metallothionein promoter.- heat shock promoters; the albumin 
promoter; the ApoAI. promoter; human globin promoters; viral 
thymidine kinase promoters, such as the Herpes Simplex thymidine 
kinase promoter; retroviral LTRs (including the modified retroviral 
LTRs herein above described); the B-actin promoter; and human 
growth hormone promoters. The promoter also may be the native 
promoter which controls the gene encoding the polypeptide. 

The retroviral plasmid vector is employed to transduce 
packaging cell lines to form producer cell lines. Bxamples of 
packaging cells which may be transfected include, but are not 
limited to. the PB501, PA317, Y-2, Y -am, PA12, T19-14X. VT-19-17- 
H2 YCRE, YCRIP. GP-B-86, GP-envAral2, and DAN cell lines as 
described in Miller, A.. Human Gene Therapy 1: 5-14 (1990). The 
vector may be transduced into the packaging cells through any means 
known in the art. Such means include, but are not limited to. 
electroporation. the use of liposomes, and CaP04 precipitation, 
in one alternative, the retroviral plasmid vector may be 
encapsulated into a liposome, or coupled to a lipid, and then 

administered to a hose. 

The producer cell line will generate infectious retroviral 
vector particles, which include the nucleic acid sequence (s) 
encoding the polypeptides. Such retroviral vector particles then 
may be employed to transduce eukaryotic cells, either in vitro 
in vivo. The transduced eukaryotic cells will express the nucleic 
acid sequence (s) encoding the polypeptide. Eukaryotic cells which 
may be transduced include, but are not linr ted to. embryonxc stem 
cells, embryonic carcinoma cells, as well as hematopoietic stem 
cells. hepatocytes, fibroblasts. myoblasts. keratinocytes. 
endothelial cells, and bronchial epithelial cells. 
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EXAMPLES 

The present invention is further described by the following 
examples. Tl\e examples are provided solely to illustrate the 
invention by reference to specific embodiments. These 
exemplification's, while illustrating certain specific aspects of 
the invention, do not portray the limitations or circumscribe the 
scope of the disclosed invention. 

Certain terms used herein are explained in the foregoing 
glossary. 

All examples were carried out using standard techniques, which 
are well known and routine to those of skill in the art, except 
where otherwise described in detail. Routine molecular biology 
techniques of the following examples can be carried out as 
described in standard laboratory manuals, such as Sambrook et al., 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed. ; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (1989), herein referred 
to as "Sambrook. " 

All parts or amounts set out in the following exatqples are by 
weight, unless otherwise specified. 

Unless otherwise stated size separation of fragments in the 
examples below was carried out using standard techniques of agarose 
and polyacrylamide gel electrophoresis ("PAGE") in Sambrook and 
numerous other references such as, for instance, by Goeddel et al.. 
Nucleic Acids Res. 8: 4057 (1980). 

Unless described otherwise, ligations were accomplished using 
standard buffers, incubation temperatures and times, approximately 
equimolar amounts of the DNA fragments to be ligated and 
approximately 10 units of T4 DNA ligase ("ligase") per 0.5 fig of 
DNA. 

Example 1 Expression and purification of human hBSF I, II and III 
using bacteria 

The DNA sequence encoding human hESP I, II or III in the 
deposited polynucleotide was amplified using PCR oligonucleotide 
primers specific to the amino acid carboxyl terminal sequence of 
the human hESF I, II or III protein and to vector sequences 3' to 
the gene. Additional nucleotides containing restriction sites to 
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facilitate cloning were added to the 5' and 3' sequences 

respectively. 

The 5' oligonucleotide primer had the sequence 

for hESF I: 5' CGC GCATGCT TGTCTGCCCAGCTG 3' (SEQ ID NO: 7) 
containing the underlined Sph I restriction site, which encodes a 
start AUG, followed by 15 nucleotides of the human hESF I coding 
sequence set out in Figure 1 (SEQ ID N0:1), beginning with the 
first base of the codon for amino acid 22 (leucine) . 

for hESF II: 5' CGCCCATGGAGTTCTGCCCAGCTC 3' (SEQ ID NO: 8) 
containing the underlined Ncol restriction site, which encodes a 
start ATG, followed by 16 nucleotides of the human hESF II coding 
sequence set out in Figure 2 (SEQ ID N0:3), beginning with the 
first base of the codon for amino acid 22. 

for hESF III: 5' CGC GCA TGC ACT GCT ATG CAG ATT 3' (SEQ ID 
NO: 9) containing the underlined SphI restriction site, which 
encodes a start ATG, followed by 16 nucleotides of the human hESF 
HI coding sequence set out in Figure 3 (SEQ ID NO: 5) . 

The 3' primer has the sequence i 

for hESF I 5' CG CAAGCTT C A'ri'1'l'rA CATGTCA 3' (SEQ ID NO: 10) 
containing the underlined Hind III restriction site followed by 15 
nucleotides complementary to 15 nucleotides of the hESF I non- 
coding sequence set out in Figure 1 (SEQ ID NO:l), including the 
stop codon. 

for hESF II 5' CG CAAGCTTA G' rmTA CATGTCA 3' (SEQ ID NO:ll) 
containing the underlined Hind III restriction site followed by 15 
nucleotides complementary to the last 15 nucleotides of the hESF 
II non-coding sequence set out in Figure 2 (SEQ ID NO: 3) , including 
the stop codon. 

for hESF III 5 ' CGC AAG CTT ACG CCT TGG GTA AAG TTA 3' (SEQ 
ID NO: 12) containing the underlined Hindlll restriction site 
followed by 18 nucleotides complementary to hESF III non-coding 
sequence set out in Figure 3 (SEQ ID NO: 5) , including the stop 
codon . 

The restrictions sites were convenient to restriction enzyme 
sites in the bacterial expression vectors pQB-60 (hESF I and II) , 
(Qiagen, Inc.) which were used for bacterial expression in these 
examples. (Qiagen, Inc. Chatsworth, CA) . pQB-60 encodes 
ampicillin antibiotic resistance ("Arnpr") and contains a bacterial 
origin of replication ("ori"), an IPTG inducible promoter, a 
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ribosorae binding site ( "RBS n ) , a 6 -His taj and restriction enzyme 

sites . 

The amplified human hESF I, II and III DNA and the vector pQE- 
60 both were digested with Sph I and Hind III (hESF I), Nco I and 
Hindi II (hESF II) and SphI and Hindi 1 1 (hESF III) and the digested 
DNAs then were ligated together. Insertion of the hESF I DNA into 
the restricted vector placed the respective coding regions 
downstream of and operably linked to the vector's IPTG- inducible 
promoter and in- frame with an initiating AUG appropriately 
positioned for translation of hESF I, II and III. 

The ligation mixture was transformed into competent E. coli 
cells using standard procedures. Such procedures are described in 
Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Bd. ; 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 
(1989) . B. coli strain Ml5/rep4, containing multiple copies of the 
plasmid pREP4 , which expresses lac repressor and. confers kanaraycin 
resistance ("Kanr"), was used in carrying out the illustrative 
example described here. This strain, which is only one of many 
that are suitable for expressing hESF I, II and III is available 
commercially from Qiagen. 

Trans f ormants were identified by their ability to grow on LB 
plates in the presence of ampicillin. Plasmid DNA was isolated 
from resistant colonies and the identity of the cloned DNA was 
confirmed by restriction analysis. 

Clones containing the desired constructs were grown overnight 
("O/N") in liquid culture in LB media supplemented with both 
ampicillin (100 ug/ml) and kanamycin (25 ug/ml) . 

The 0/N culture was used to inoculate a large culture, at a 
dilution of approximately 1:100 to 1:250. The cells were grown to 
an optical density at 600nm < n OD600 n ) of between 0.4 and 0.6. 
isopropyl-B-D-thiogalactopyranoside ( tt IPTG w ) was then added to a 
final concentration of 1 mM to induce transcription from lac 
repressor sensitive promoters, by inactivating the lad repressor. 
Cells subsequently were incubated further for 3 to 4 hours. Cells 
then were harvested by centrif ugation and disrupted, by standard 
methods. Inclusion bodies were purified from the disrupted cells 
using routine collection techniques, and rrotein was solubilized 
from the inclusion bodies into 8M urea. The 8M urea solution 
containing the solubilized protein was passed over a PD-10 column 
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ir -X phosphate buffered saline CPT3-) . thereby removing the urea, 
exchanging the buffer an," refolding 'he protein. The protein was 
purified by a further step of chromatography to remove endotoxin. 
Then, it was sterile filtered. The sterile filtered protein 
preparation was stored in 2X PBS at a concentration of 95 

micrograms per mL. 

Analysis of the preparation by standard methods of 
polyacrylamide gel electrophoresis revealed that the preparation 
contained about 95% monomer hESF I. II and III having the expected 
molecular weight. 

Bxample 2 Cloning and expression of human hESF I , II and III in a 
baculovirus expression system 

The cDNA sequence encoding the full length human hBSF I, II 
and III protein, in the deposited clone is amplified using PCR 
oligonucleotide primers corresponding to the 5' and 3' sequences 

of the gene: 

for hESF I the 5' primer has the sequence 5' CCCOOATCC 
GCCATCAT^AGGCTGTCAGTGTGTCT 3' (SBQ ID NO: 13) containing the BamHI 
restriction enzyme site (bold) followed by a kozak sequence (GCC 
ATC) and 20 bases of the sequence of hBSF I of Figure 1 (SBQ ID 

^^'for hESF II the 5' primer has the sequence 5' CGC GGA TCC GCC 
ATC ATG AAG CTG TCG GTG 3' (SBQ ID N0:14) containing the BamHI 
restriction enzyme site (bold) followed by 15 bases of the sequence 
of hESF II of Figure 2 (SBQ ID NO:3) ; 

for hBSF III the 5' primer has the sequence 5' CGC GJ5n_I£C GCC 
ATC ATG AAG CTG CTG A.TG GTC 3' (SBQ ID NO: IS) containing the BamHI 
restriction enzyme site (bold) followed by IS bases of the sequence 
of hBSF III of Figure 3 (SBQ ID NO:S) . 

inserted into an expression vector, as described below, the 
5' end of the amplified fragment encoding human hBSF I. II or III 
provides an efficient signal peptide. An efficient signal for 
initiation of translation in eukaryotic cells, as described by 
Kozak. M. , J . Mol. Biol. 196: 947-950 (1987, is appropriately 
located in the vector portion of the construct. 

For hBSF I the 3' primer has the sequence 5' CCCGGTACC 
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'J " '.TIT ITI'IT ' TITIT r V 3' (SBQ ID NO: 16) containing the underlined 
Asp7i3 restriction site followed by ±8 nucleotides complementary 
to the poly A tail; 

for hESP II the 3' primer has the sequence 5' CGC 
GGTATCACX3CCTTGGGTAAAGTTA 3' (SBQ ID NO: 17) containing the 
underlined Asp7l8 restriction followed by nucleotides complementary 
to 15 nucleotides of the hBSF II non-coding sequence set out in 
Figure 2 (SBQ ID NO:3), including the stop codon; 

for hESF III the 3' primer has the sequence 5' CGC GGT ACC ACG 
CCT TGG GTA AAG TTA 3' (SBQ ID NO: 18) containing the underlined 
Asp7l8 restriction followed by nucleotides complementary to 18 
nucleotides of the hESF III non-coding sequence set out in Figure 
3 (SBQ ID N0:5), including the stop codon. 

The amplified fragments are isolated from a 1% agarose gel 
using a cotmnercially available kit ("Geneclean, " BIO 101 Inc., La 
Jolla, Ca.) . The fragments then are digested with the respective 
restriction enzymes and again are purified on a 1% agarose gel. 
This fragments are designated herein F2 . 

The vector pRGl is used to express the hESF I, II or III 
protein in the baculo virus expression system, using standard 
methods, such as those described in Summers et al, A MANUAL OF 
METHODS FOR BACULO VIRUS VECTORS AND INSECT CELL CULTURE PROCEDURES , 
Texas Agricultural Experimental Station Bulletin No. 1555 (1987) . 
This expression vector contains the strong polyhedrin promoter of 
the Autographa calif ornica nuclear polyhidrosis virus (AcMNPV) 
followed by convenient restriction sites. The polyadenylation site 
of the simian virus 40 C f SV40") is used for efficient 
polyadenylation. For an easy selection of recombinant virus the 
beta-galactosidase gene from E.coli is inserted in the same 
orientation as the polyhedrin promoter and is followed by the 
polyadenylation signal of the polyhedrin gene. The polyhedrin 
sequences are flanked at both sides by viral sequences for cell- 
mediated homologous recombination with wild- type viral DNA to 
generate viable virus that express the cloned polynucleotide. 

Many other baculovirus vectors could be used in place of pA2, 
such as pAc373, pVL94l and pAcIMl provided, as those of skill 
readily will appreciate, that construction provides appropriately 
located signals for transcription, translation, trafficking and the 
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. like. Such vectors are described in Luckow et al . . Virology 170: 

31-3 9. among others. 

The plasmid is digested with the respective restriction 
enzymes and then is dephosphorylated using calf intestinal 
phosphatase, using routine procedures known in the art. The DNA 
is then isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean" BIO 101 Inc.. La Jolla, CaJ. This 
vector DNA is designated herein "V2". 

Fragments F2 and the dephosphorylated plasmid V2 are ligated 
together with T4 DNA ligase. B.coli HB101 cells are transformed 
with ligation mix and spread on culture plates. Bacteria are 
identified that contain the plasmid with the human hESF I, II or 
III gene by digesting DNA from individual colonies using the 
respective restriction enzymes and then analyzing the digestion 
product by gel electrophoresis. The sequence of the cloned 
fragment is confirmed by DNA sequencing. This plasmid is 
designated herein pBachBSF I, II or III. 

5 M9 of the plasmid pBachBSF I. II or III is co-transf ected 
with 1.0 ^9 of a commercially available linearized baculovirus DNA 
(■BaculoGold™ baculovirus DNA", Pharmingen. San Diego, CA. ) . using 
the lipofection method described by Feigner et al.. Proc. Natl. 
Acad. Sci. USA 84: 7413-7417 (1987) . l M g of BaculoGold" virus DNA 
and 5 M g =f the plasmid pBachBSF I, II or III are mixed in a 
sterile well of a microtiter plate containing 50 til of serum free 
Grace's medium (Life Technologies Inc., Gaithersburg. MD) . 
Afterwards 10 nl Lipofectin plus 90 pi Grace's medium are added, 
mixed and incubated for 15 minutes at room temperature. Then the 
transfection mixture is added drop-wise to Sf9 insect cells (ATCC 
CRL 1711) seeded in a 35 mm tissue culture plate with 1 ml Grace's 
medium without serum. The plate is rocked back and forth to mix 
the newly added solution. The plate is then incubated for 5 hours 
at 27 *C. After 5 hours the transfection solution is removed from 
the plate and l ml of Grace's insect medium supplemented with 10% 
fetal calf serum is added. The plate is put back into an incubator 
and cultivation is continued at 27 *C for four days. 

After four days the supernatant is collected and a plaque 
assay is performed, as described by Summers and Smith, cited above. 
An agarose gel with "Blue Gal" (Life Technologies Inc., 
Gaithersburg) is used to allow easy identification and isolation 
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of gal -expressing clones, which produce blue-stained plaques. (A 
detailed description of a "plaque assay" of this type can also be 
found in the user's guide for insect cell culture cnH 
baculovirology distributed by Life Technologies Inc. , Gaithersburg, 
page 9-10) . 

Pour days after serial dilution, the virus is added to the 
cells. After appropriate incubation, blue stained plaques axe 
picked with the tip of an Eppendorf pipette. The agar containing 
the recombinant viruses is then resuspended in an Eppendorf tube 
containing 200 fil of Grace's medium. The agar is removed by a 
brief centrif ugation and the supernatant containing the recombinant 
baculovirus is used to infect Sf9 cells seeded in 35 mm dishes. 
Four days later the supernatant s of these culture dishes are 
harvested and then they are stored at 4*C. A clone containing 
properly inserted hBSP I, II or III is identified by DNA analysis 
including restriction mapping and sequencing. ( This is designated 
herein as V-hESF I, II or III. r 

Sf9 cells are grown in Grace's medium supplemented with 10% 
heat -inactivated FBS. The cells are infected with the recombinant 
baculovirus V-hBSF I, II or III at a multiplicity of infection 
("MOI") of about 2. Six hours later the medium is removed and is 
replaced with SF900 II medium minus methionine and cysteine 
(available from Life Technologies Inc., Gaithersburg) . 42 hours 
later, 5 pCi of 35S -methionine and 5 fid 35S cysteine (available 
from Amersham) are added. The cells are further incubated for 16 
hours and then they are harvested by centrif ugation, lysed and the 
labeled proteins are visualized by SDS-PAGE and autoradiography. 

Example 3 Expression of hBSP I, II and III in COS cells 

The expression plasnud. hBSP l ( n and III HA, is made by 
cloning a cDNA encoding hBSF I, II and III into the expression 
vector pcDNAI/Amp (which can be obtained from Invitrogen, Inc.). 

The expression vector pcDNAI/atnp contains: (1) an E.coli 
origin of replication effective for propagation in E. coli and 
other prokaryotic cell; (2) an ampicillin resistance gene for 
selection of plasmid-containing prokaryotic cells; (3) an SV40 
origin of replication for propagation in eukaryotic cells; (4) a 



-55- 



W0 97,34*>7 PCT/US96/03857 

CMV promoter, a polylinker. an SV40 intron. and a polyadenylation 
signal arranged so that a cDNA conveniently can be placed under 
expression control of the W promoter ana operably linked to the 
SV40 intron and the polyadenylation signal by means of restriction 
sites in the polylinker. 

A DNA fragment encoding the entire hESF I. II and III 
precursor and a HA tag fused in frame to its 3' end is cloned into 
the polylinker region of the vector so that recombinant protein 
expression is directed by the CMV promoter. The HA tag corresponds 
to an epitope derived from the influenza hemagglutinin protein 
described by Wilson et al. ( Cell 37: 767 (1984) . The fusion of the 
HA tag to the target protein allows easy detection of the 
recombinant protein with an antibody that recognizes the HA 
epitope . 

The plasmid construction strategy is as follows: 

The hBSP I, II and III cDNA of the deposit clone is amplified 
using primers that contained convenient restriction sites, much as 
described above regarding the construction of expression vectors 
for expression of hESF I. II and III in B. coli and S. fugiperda. 

To facilitate detection, purification and characterization of 
the expressed hBSF I, II and III. one of the primers contains a 
heaniaglutinin tag ("HA tag") as described above. 

Suitable primers include that following, which are used m 

this example: 

The 5' primer, containing the underlined BamHI site, an AUG 
start codon and has the following sequence. 5' CGC GGA TCC ACC ATG 
GTC TCG CTG GCC CTT 3' (SBQ ID N0:19) (ESFI) ,• 5' CGC GGA TCC ACC 
ATG AAG CTG TCG GTG TGT 3' (SEQ ID NO:20) (BSFII) ; 5 ' CGC GGA TCC 
ACC ATG AAG CTG CTG ATG GTC 3 ' (SBQ ID N0:21) (BSFIII) . 

The 3' primer, containing the underlined Xbal site, stop 
codon, HA tag and 15 bp of 3' coding sequence (at the 3' end) has 
the following sequence: 

5' CGC TCT *GA TCA AGC GTA GTC TGG GAC GTC GTA TGG GTA CAC ACC ACA 
TTT TTT 3 (SBQ ID NO:22) (BSFI) ; 5' CGC IS^AGA TCA AGC GTA GTC 
TGG GAC GTC GTA TGG GTA CAC ACT ACA TTT CTT 3' (SEQ ID NO:23> 

(BSFII) 5' CGC TCT AGA TCA AGC GTA GTC TGG GAC GTC GTA TGG GTA ATT 
ACT CTT CAT ATT 3 ' (SEQ ID NO: 24) (BSFIII) 

The PCR amplified DNA fragment and the vector, pcDNAI/Amp, are 

digested with and then ligated. The ligation mixture is 
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transformed into B. coli strain SURE (available from Stratajene 
Cloning Systems, 11099 North Torrey Pines Road, La Jolla, CA 92037) 
the transformed culture is plated on ampicillin media plates which 
then are incubated to allow growth of ampicillin resistant 
colonies. Plasmid DNA is isolated from resistant colonies and 
examined by restriction analysis and gel sizing for the presence 
of the hBSF I, II and III -encoding fragment . 

For expression of recombinant h£SF I, II and III, COS cells 
are transfected with an expression vector, as described above, 
using DEAB-DEXTRAN, as described, for instance, in Sambrook et al., 
MOLECULAR CLONING: A LABORATORY MANUAL, Cold Spring Laboratory 
Press, Cold Spring Harbor, New York (1989) . Cells are 

incubated under conditions for expression of hBSF I, II and III by 
the vector. 

Expression of the hESF I, II and III HA fusion protein is 
detected by radiolabelling and iraraunoprecipitation, uBing methods 
described in, for example Harlow et al., ANTIBODIES: A LABORATORY 
MANUAL, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York (1988) . To this end, two days after transf ection # 
the cells are labeled by incubation in media containing 35S- 
cysteine for 8 hours. The cells and the media are collected, and 
the cells are washed and the lysed with detergent -containing RIPA 
buffer: 150 mM NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 tnM 
TRIS, pH 7.5, as described by Wilson et al. cited above. Proteins 
are precipitated from the cell lysate and from the culture media 
using an HA- specific monoclonal antibody. The precipitated 
proteins then are analyzed by SDS-PAGE gels and autoradiography. 
An expression product of the expected size is seen in the cell 
lysate, which is not seen in negative controls. 

Example 4 Tissue distribution of hESF I, II and III expression 

Northern blot analysis is carried out to examine the levels 
of expression of hESF I, II and III in human tissues, using methods 
described by, among others, Sambrook et al, cited above. Total 
cellular RNA samples are isolated with RNAzol™ B system (Biotecx 
Laboratories, Inc. 6023 South Loop East, Houston, TX 77033) . 

About lOjig of Total RNA is isolated from tissue samples. The 
RNA is size resolved by electrophoresis through a 1% agarose gel 



-57- 



WO 97/34997 



PCT/US96/03857 



'■under strongl/ denaturing conditions. RNA is blotted from the gel 
onto i nylon filter, and the filter then is prepared for 
hybridization to a detectably labeled polynucleotide probe. 

As a probe to detect mRNA that encodes hBSF I, II and III, the 
antisense strand of the coding region of the cDNA insert in the 
deposited clone is labeled to a high specific activity. The cDNA 
is labeled by primer extension, using the Prime-It kit, available 
from Stratagene. The reaction is carried out using 50 ng of the 
cDNA, following the standard reaction protocol as recommended by 
the supplier. The labeled polynucleotide is purified away from 
other labeled reaction components by column chromatography using 
a Select -G- 50 column, obtained from 5 -Prime - 3 -Prime, Inc. of 5603 
Arapahoe Road, Boulder. CO 80303. 

The labeled probe is hybridized to the filter, at a 
concentration of l. 000. 000 cpm/ml, in a small volume of 7% SDS, 0.5 
M NaP04, pH 7.4 at 65* C. overnight. 

Thereafter the probe solution is drained and the filter is 
washed twice at room temperature and twice at 60 with 0.5 x SSC. 
0.1% SDS. The filter then is dried and exposed to film at -70*C 
overnight with an intensifying screen. 



Example 5 Gene therapeutic expression of human hBSF I. 



II and III 



Fibroblasts are obtained from a subject by skin biopsy. The 
resulting tissue is placed in tissue-culture medium and separated 
into small pieces. Small chunks of the tissue are placed on a wet 
surface of a tissue culture flask, approximately ten pieces are 
placed in each flask. The flask is turned upside down, closed 
tight and left at room temperature overnight. After 24 hours at 
room temperature, the flask is inverted - the chunks of tissue 
remain fixed to the bottom of the flask - and fresh media is added 
(e.g., Ham's F12 media, with 10% FBS, penicillin and streptomycin). 
The tissue is then incubated at 37 *C for approximately one week. 
At this time, fresh media is added and subsequently changed every 
several days. After an additional two weeks in culture, a 
monolayer of fibroblasts emerges . The monolayer is trypsinized and 
scaled into larger flasks. 

A vector for gene therapy is digested with restriction enzymes 
for cloning a fragment to be expressed. The digested vector is 
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treatertfwith calf intestinal phosphatase to prevent self - ligation . 
The dephosphorylated, linear vector is fractionated on an agarose 
gel and purified. 

hESF I, II and III cDNA cap^le of expressing active hESF I, 
II and III, is isolated. The ends of the fragment are modified, 
if necessary, for cloning into the vector. For instance, 5 n 
overhanging may be treated with DNA polymerase to create blunt 
ends. 3.' overhanging ends may be removed using SI nuclease. 
Linkers may be ligated to blunt ends with T4 DNA ligase. 

Equal quantities of the Moloney murine leukemia virus linear 
backbone and the hESF I, II or III fragment are mixed together and 
joined using T4 DNA ligase. The ligation mixture is u$ed to 
transform E. Coli and the bacteria are then plated onto agar- 
containing kanamycin. Kanamycin phenotype and restriction analysis 
confirm that the vector has the properly inserted gene. 

Packaging cells are grown in tissue culture to confluent 
density in Dulbecco's Modified Eagles Medium (D^JEM) with 10% calf 
serum (CS) , penicillin and streptomycin. The vector containing the 
hESF I, II or III gene is introduced into the packaging cells by 
standard techniques. Infectious viral particles containing the hESF 
I, II or III gene are collected from the packaging cells, which now 
are called producer cells. 

Fresh media is added to the producer cells, and after an 
appropriate incubation period media is harvested from the plates 
of confluent producer cells. The media, containing the infectious 
viral particles, is filtered through a Millipore filter to remove 
detached producer cells. The filtered media then is used to infect 
fibroblast cells. Media is removed from a sub-confluent plate of 
fibroblasts and quickly replaced with the filtered media. 
Polybrene (Aldrich) may be included in the media to facilitate 
transduction. After appropriate incubation, the media is removed 
and replaced with fresh media. If the titer of virus is high, then 
virtually all fibroblasts will be infected and no selection is 
required. If the titer is low, then it is necessary to use a 
retroviral vector that has a selectable marker, such as neo or his, 
to select out transduced cells for expansion. 

Engineered fibroblasts then may be injected into rats, either 
alone or after having been grown to confluence on microcarrier 
beads, such as cytodex 3 beads. The injected fibroblasts produce 
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hBSF I, II or III product, and the biological actions of the 
protein are conveyed to the host. 

It will be clear that the invention may be practiced otherwise 
than as particularly described in the foregoing description and 
examples . 

Numerous modifications and variations of the present invention 
are possible in light of the above teachings and, therefore, are 
within the scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORM, i ?N : 

(i) APPLICANT: Gentz, Reiner 

(ii) TITLE OF INVENTION: HUMAN ENDOMETRIAL SPECIFIC 
STEROID -BINDING FACTOR I, II AND III 

(iii) NUMBER OF SEQUENT S 27 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, CECCHI , 

STEVIART & OLSTEIN 

(B) STREET: 6 BECKER FARM ROAD 

(C) CITY: ROS ELAND 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 
(P) ZIP: 07068-1739 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patencln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Perraro, Gregory D 

(B) REGISTRATION NUMBER: 36,134 

(C) REFERENCE /DOCKET NUMBER: 32S800-520 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SEQ ID NO ; 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 433 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic J 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 43. .312 

(ix) FEATURE: 

(A) NAME/KEY: sigjJepcide 

(B) LOCATION: 43. 105 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 106 . .312 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 

TCACTCATTS TGAAAGCTGA GCTCACAGCC GAATAAGCCA CC ATG AGG CTG TCA 54 

Met Arg Leu Ser 
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-21 -20 

GTG TGT CTC CTG ATG GTC TCG CTG GCC CTT TGC TGC TAC CAG GCC CAT 102 
Val Cys Leu Leu Mec Val Ser Leu Ala Leu Cys Cys Tyr Gin Ala His 
-15 -10 -5 

GCT CTT GTC TGC CCA GCT GTT GCT TCT GAG ATC ACA GTC TTC TTA TTC 150 
Ala Leu Val Cys Pro Ala val Ala Ser Glu lie Thr Val Phe Leu Phe 
1 5 10 15 

TTA AGT GAC GCT GCG GTA AAC CTC CAA GTT GCC AAA CTT AAT CCA CCT 198 
Leu Ser Asp Ala Ala Val Asn Leu Gin Val Ala Lys Leu Asn Pro Pro 
20 25 30 

CCA GAA GCT CTT GCA GCC AAG TTG GAA GTG AAG CAC TGC ACC GAT CAG 246 
Pro Glu Ala Leu Ala Ala Lys Leu Glu Val Lys His Cys Thr Asp Gin 
35 40 45 

ATA TCT TTT AAG AAA CGA CTC TCA TTG GAA AAA GTC CTG GTG GAA ATA 294 
lie Ser Phe Lys Lys Arg Leu Ser Leu Glu Lys Val Leu Val Glu He 
50 55 60 

GTG AAA AAA TGT GGT GTG TGACATGTAA AAATGCTCAA CCTGGTTTCC 342 
Val Lys Lys Cys Gly Val 
€5 

AAAGTCTTTC AACGACACCC TGATCTTCAC TAAAAATTGT AAAGGTTTCA ACACGTTGCT 402 

TTAATAAATC ACTTGCCCTG CACATCAAAA A 433 

' 1 
(2) INFORMATION FOR SEQ ID NO: 2: * 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Arg Leu Ser Val Cys Leu Leu Met Val Ser Leu Ala Leu Cys Cys 

-21 -20 -15 -10 

Tyr Gin Ala His Ala Leu Val Cys Pro Ala Val Ala Ser Glu lie Thr 

-5 l 5 10 

Val Phe Leu Phe Leu Ser Asp Ala Ala Val Asn Leu Gin Val Ala Lys 
15 20 25 

Leu Asn Pro Pro Pro Glu Ala Leu Ala Ala Lys Leu Glu Val Lys His 
30 35 40 

Cys Thr Asp Gin He Ser Phe Lys Lys Arg Leu Ser Leu Glu Lys Val 
45 50 55 

Leu Val Glu lie val Lys Lys Cys Gly Val 
60 65 

(2) INFORMATION FOR SEQ ID NO: 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(IX) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 40 . J09 

(ix) PEATURE: 

(A) NAME/KEY: sig j>e?~ ide 

(B) LOCATION : 40. " 102 

(ix) FEATURE: 

(A) NAME/KEY; mat^peptide 

(B) LOCATION: 103 . .309 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTGTTTGTGA AAGCTGAGCT CACAGCAAAA CAAGCCACC ATG AAG CTG TCG GTG 54 

Met Lys Leu Ser Val 

-21 -20 

TGT CTC CTG CTG GTC ACG CTG GCC CTC TGC TGC TAC CAG GCC AAT GCC 102 
Cys Leu Leu Leu Val Thr Leu Ala Leu Cys Cys Tyr Gin Ala Asn Ala 
-15 -10 -5 

GAG TTC TGC CCA GCT CTT GTT TCT GAG CTG TTA GAC TTC TTC TTC ATT 150 
Glu Phe Cys Pro Ala Leu Val Ser Glu Leu Leu Asp Phe Phe Phe lie 
15 10 is 

AGT GAA CCT CTG TTC AAG TTA AGT CTT GCC AAA TTT GAT GCC CCT CCG 198 
Ser Glu Pro Leu Phe Lys Leu Ser Leu Ala Lys Phe Asp Ala Pro Pro 

20 25 30 \ . 

GAA GCT GTT GCA GCC AAG TTA GGA GTG AAG AGA TGC ACG GAT CAG ATG 246 
Glu Ala Val Ala Ala Lys Leu Gly Val Lys Arg Cys Thr Asp Gin Met 
35 40 45 

TCC CTT CAG AAA CGA AGC CTC ATT GCG GAA GTC CTG GTG AAA ATA TTG 294 
Ser Leu Gin Lys Arg Ser Leu He Ala Glu Val Leu Val Lys He Leu 
50 55 60 

AAG AAA TGT AGT GTG TGACATGTAA AAACTTTCAT CCTGGTTTCC ACTGTCTTTC 349 
Lys Lys Cys ser Val 
65 

AATGACACCC TGATCTTCAC TGCAGAATGT AAAGGTTTCA ACGTCTTGCT TTAATAAATC 4 09 

ACTTGCTCTC CAAAAAAAAA AAAAAAA 436 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Lys Leu Ser Val Cys Leu Leu Leu Val Thr Leu Ala Leu Cvs Cvs 
-21 -20 -15 -10 

Tyr Gin Ala Asn Ala Glu Phe Cys Pro Ala Leu Val Ser Glu Leu Leu 

~ 5 l 5 io 

Asp Phe Phe Phe He Ser Glu Pro Leu Phe Lys Leu Ser Leu Ala Lys 
IS 20 25 

Phe Asp Ala Pro Pro Glu Ala Val Ala Ala Lys Leu Gly Val Lys Arg 
30 35 40 
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C»z Thr te/j31n Met Ser Leu Gin Lys Arg Ser Leu lie Ala Glu Val 
45 50 55 

Leu Val Lys He Leu Lys Lys Cys Ser Val 

r.O 65 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 46 . .330 

(ix) FEATURE: 

(A) NAME/ KEY : sigjeptide 

(B) LOCATION: 46. .108 

(ix) FEATURE: 

(A) NAME /KEY: mat_j>eptide 

(B) LOCATION: 109.. 330 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:S: 
ACGAGCTGCC ACGCACGACT GAACACAGAC AGCAGCCGCC TCGCC ATO AAG CTG 

-21 -20 



CTG ATG GTC CTC ATG CTG GCG GCC CTC CTC CTG CAC TGC TAT GCA GAT 
Leu Met val Leu Met Leu Ala Ala Leu Leu Leu His Cys Tyr Ala Asp 

.15 -10 "5 

TCT GGC TGC AAA CTC CTG GAG GAC ATG GTT GAA AAG ACC AXC AAT TCC 
Ser Gly Cys Lys Leu Leu Glu Asp Met Val Glu Lys Thr lie Asn Ser 



G*C ATA TCT ATA CCT GAA TAC AAA GAG CIT CTr CAA GAG TTC ATA GAC 
Sp JI1 S?r lie pS Glu Tyr Lys Glu Leu Leu Gin Glu Phe He Asp 
15 20 25 



AGT GAT GCC GCT GCA GAG GCT ATG GGG AAA TTC AAG CAG TGT TTC CTC 
Sr ™ S All £a Glu Ala Kec Gly Lys Phe Lys Gin Cys Phe Leu 
35 «0 45 

AAC CAG TCA CAT AGA ACT CTG AAA AAC TTT GGA CTG ATG ATG CAT ACA 
Asn Gin Ser His Arg Thr Leu Lys Asn Phe Gly Leu Met Met His Thr 
so ss 60 

GTC TAC GAC AGC ATT TGG TGT AAT ATG AAG AGT AAT TAACTTTACC 
Val Tyr Asp Ser He Trp Cys Asn Met Lys Ser Asn 

65 70 
CAAGGCGTTT GGCTCAGAGG GCTACAGACT ATGGCCAGAA CTCATCTGTT GATTGCTAGA 

aaccactttc rrcrrGTcrr g cuuia tg toggaactcc tagacaactg ttgaaacctc 

AATTCATTCC ATTTCA 

(2) INFORMATION FOR SEQ ID NO: 6: 

U) SEQUENCE CHARACTERISTICS: 

64 
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(A) LENGTH: 95 amino acids 
(B } TYPE: amino acid 
(D) TOPOLOGY: linear 

f.ii) MOLECULE ' TYPE : procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Lys Leu Leu Met Val Leu Met Leu Ala Ala Leu Leu Leu His Cy3 

-21 -20 .-15 -io 

Tyr Ala Asp Ser Gly Cys Lys Leu Leu Glu Asp Met Val Glu Lys Thr 
-5 15 10 

lie Asn Ser Asp lie Ser lie Pro Glu Tyr Lys Glu Leu Leu Gin Glu 
15 20 25 

Phe He Asp Ser Asp Ala Ala Ala Glu Ala Met Gly Lys Phe Lys Gin 
30 35 40 

Cys Phe Leu Asn Gin Ser His Arg Thr Leu Lys Asn Phe Gly Leu Met 
45 50 55 

Met His Thr Val Tyr Asp Ser He Trp Cys Asn Met Lys Ser Asn 
60 €5 70 

(2) INFORMATION FOR SEQ IO NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 24 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CGCGCATGCT TGTCTGCCCA GCTG 24 
(2) INFORMATION POR SEQ ID NO: 8: 

( i ) SEQUENCE. CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 : 
CGCCCATGGA GTTCTGCCCA GCTC 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CGCGCATGCA CTGCTATGLA GATT 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES3 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGCAAGCTTC ATTTTTACAT GTCA 
(2) INFORMATION FOR SEQ ID NO: IX: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CGCAAGCTTA GTTTTTACAT GTCA 
12) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 12 
CGCAAGCTTA CGCCTTGGGT AAAGTTA 
(2) INFORMATION FOR SEQ ID NO: 13: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY* linear 

(ii) MOLECULE TYPE : other nucleic acid 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 3 
CCCGGATCCG CCATCATGAG GCTGTCAGTG TGTCT 
(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
CGCGGATCCG CCATCATGAA GCTGTCGGTG 
i2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
CGCGGATCCG CCATCATGAA GCTGCTGATG GTC 
(2) INFORMATION POR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION SEQ ID NO; 16: 

CCCGGTACCT TTTTTTTTTT 11 1 I ' m 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 27 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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CGCnGTACCA CTCCTTGGGT AAAGTTA 
,2) INFORMATION FOR SEQ ID NO - 9 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pait^ 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CGCGGTAACA CGCCTTGGGT AAAGTTA 
(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
CGCGGATCCA CCATGGTCTC GCTGGCCCTT 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
CGCGGATCCA CCATGAAGCT GTCGGTGTGT 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
CGCGGATCCA CCATGAAGCT GCTGATGGTC 
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(2) INFORMATION POR SEQ ID NO: 22: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 
{C) STRAND ED NESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO:22: 
CGCTCTAGAT CAAGCGTAGT CTGGGACGTC GTATGGGTAC ACACCACATT TTTT 54 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 54 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGCTCTAGAT CAAGCGTAGT CTGGGACGTC GTATGGGTAC ACACTACATT TCTT 54 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CGCTCTAGAT CAAGCGTAGT CTGGGACGTC GTATGGGTAA TTACTCTTCA TATT 54 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

He Glu Leu Ser Leu Cys Leu Leu He Met Leu Ala Val Cys Cys Tyr 
1 5 10 15 

Glu Ala Asn Ala Ser Gin He Cys Glu Leu Val Ala His Glu Thr He 
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20 

Ser Phe Leu Met Lys Ser Glu Glu 
■ 35 40 

Tyr Asn Ala Pro Pro Ala Ala Val 

50 • 55 

Cys Val Asp Gin Met Ser Asn Gly 

65 * 70 

Leu Val Tyr tie Phe Leu Glu Cys 
85 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



25 30 

Glu Leu Lys Lys Glu Leu Glu Met 
45 

Glu Ala Lys Leu Glu Val Lys Arg 
60 

Asp Arg Leu val Val Ala Glu Thr 
75 ao 

Gly Val 
90 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

He Glu Leu Ser Leu Cys Leu Leu He Met Leu Ala Val Cys Cys Tyr 
X 5 10 XS 

Glu Ala Asn Ala Ser Gin lie Cys Glu Leu Val Ala His Glu Thr He 
20 25 3 

Ser Phe Leu Met Lys Ser Glu Glu Glu Leu Lys Lys Glu Leu Glu Met 
35 40 « 

Tvr Asn Ala Pro Pro Ala Ala Val Glu Ala Lys Leu Glu Val Lys Arg 
50 5S 60 

Cys val Asp Gin Met Ser Asn Gly Asp Arg Leu val Val Ala Glu Thr 
65 70 75 8 ° 

Leu Val Tyr He Phe Leu Glu Cys Gly Val 
B5 90 

(2) INFORMATION FOR SBQ ID NO: 27. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Lys Leu Val Phe Leu Phe Leu Leu val Thr He Pro He Cys Cys 

1 5 10 15 

Tyr Ala Ser Gly Ser Gly Cys Ser He Leu Asp Glu Val He Arg Gly 
20 25 30 

Thr He Asn Ser Thr Val Thr Leu His Asp Tyr Met Lys Leu Val Lys 
35 *0 45 
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Pro Tyr Val Gin Ala His Phe Thr Glu «vs Ala Val Lys Gin Phe r.yii 

50 55 ' 60 

Gin Cys Phe t ; e.. A3p Gin Thr Asp L> 3 Thr Leu Clu Asn Val Gly Val 

65 70 75 80 

Met Met Glu Ala lie Phe Asn Ser Glu Ser Cys Gin Gin Pro Ser 
85 90 95 



\ 
\ 
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WHAT TS CLAIMBD IS : 

1. An isolated polynucleotide comprising a member selected from 

the group consisting of: 

(a) a polynucleotide having at least a 70% identity to a 
polynucleotide enpoding a polypeptide comprising amino acid 1 to 

69 of SEQ ID NO : 2; 

(b) a polynucleotide having at least a 70% identity to a 
polynucleotide encoding a polypeptide comprising amino acid 1 to 
amino acid 69 set forth in SEQ ID N0:4; 

(c) a polynucleotide having at least a 70% identity to a 
polynucleotide encoding a polypeptide comprising amino acid 1 to 
amino acid 74 set forth in SEQ ID NO: 6; 

(d) a polynucleotide which is complementary to the 
polynucleotide of (a) , (b) or (c) ; and 

(e) a polynucleotide comprising at least 15 bases of the 
polynucleotide of (a), (b) , (c) or (d) . 

2. The polynucleotide of Claim 1 wherein the polynucleotide is 
DNA. 

3 . The polynucleotide of Claim l wherein the polynucleotide is 
RNA. 

4 . The polynucleotide of Claim l wherein the polynucleotide is 
genomic DNA. 

5. The polynucleotide of Claim 2 which encodes the polypeptide 
comprising amino acid 1 to 69 of SEQ ID NO:2. 

6 . The polynucleotide of Claim 2 which encodes the polypeptide 
comprising amino acid 1 to 69 of SBQ ID NO: 4. 

7. The polynucleotide of Claim 2 which encodes the polypeptide 
comprising amino acid 1 to 74 of SEQ ID NO: 6. 

8. An isolated polynucleotide comprising a member selected from 
the group consisting of : 
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(a) a polynucleotide which jncodes a mature polypeptide having 
the amino acid sequence' expressed by the human cDNA contained in 
ATCC Deposit No. 97401; 

(b) a polynucleotide which encodes a mature polypeptide having 
the amino acid sequence expressed by the human cDNA contained in 
ATCC Deposit No. 97402; 

(c) a polynucleotide which encodes a mature polypeptide having 
the amino acid sequence expressed by the human cDNA contained in 
ATCC Deposit No. 97403; 

(d) . a polynucleotide which is complementary to the 
polynucleotide of (a) , (b) or (c) / and 

(e) a polynucleotide comprising at least 15 bases of the 
polynucleotide of (a) , (b) , (c) or (d) . 

9. The polynucleotide of claim 1 comprising nucleotide 106 to 
nucleotide 312 of SEQ ID NO:l. 

10. The polynucleotide of claim 1 comprising nucleotide 103 to 
nucleotide 309 of SEQ ID NO: 3. 

11. The polynucleotide of claim l comprising nucleotide 109 to 
nucleotide 330 of SEQ ID NO: 5. 

12. A vector comprising the DNA of Claim 2. 

13. A host cell comprising the vector of Claim 12. 

14. A process for producing a polypeptide comprising: expressing 
from the host cell of Claim 13 the polypeptide encoded by said DNA. 

15. A process for producing a cell which expresses a polypeptide 
comprising genetically engineering the cell with the vector of 
Claim 12 such that the cell expresses the polypeptide encoded by 
the huma cDNA in the vector. 

16. A polypeptide comprising a member selected from the group 
consisting of: 

(a) a polypeptide comprising amino acid 1 to 69 of SBQ ID NO:2; 

(b) a polypeptide comprising amino acid 1 to 69 of SEQ ID NO: 4,- 
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.<c) a oolypentide comprising amino acid 1 to 74 of SEQ ID NO: 6; 
(d) a polypeptide which is at least 70% identical to the 
polypeptide of (a) , (b) or (c) . 

17. The polypeptide of Claim 16 wherein the polypeptide comprises 
amino acid 1 to amino acid 69 of SEQ ID N0:2. 

18. The polypeptide of Claim 16 wherein the polypeptide comprises 
amino acid 1 to amino acid 69 of SEQ ID N0:4. 

19. The polypeptide of Claim 16 wherein the polypeptide comprises 
amino acid 1 to amino acid 74 of SEQ ID NO: 6. 

20. A compound which inhibits activation of the polypeptide of 
claim 16. 

21. A method for the treatment of a patient having need of hBSP 
I. II or III comprising: administering to the patient a 
therapeutically effective amount of the polypeptide of claim 16. 

22. The method of Claim 21 wherein said therapeutically effective 
amount of the polypeptide is administered by providing to the 
patient DNA encoding said polypeptide and expressing said 
polypeptide in vivo. 

23. A method for the treatment of a patient having need to inhibit 
a hESF I. II or III polypeptide comprising: administering to the 
patient a therapeutically effective amount of the compound of Claim 
20. 

24. A process for diagnosing a disease or a susceptibility to a 
disease related to an under -expression of the polypeptide of claim 
16 comprising: 

determining a mutation in a nucleic acid sequence encoding said 
polypeptide. 

25. A diagnostic process comprising: 

analyzing for the presence of the polypeptide of claim 16 in a 
sample derived from a host . 
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26. A method for identifying compounds which bind to and inhibit 
activation of the polypeptide of claim 16 comprising ; contacting 
a cell' expressing on the surface thereof a receptor for the 
polypeptide, said receptor being "^ociated with a second component 
capable of providing a detectable signal in response to the binding 
of a compound to said receptor, with an analytically detectable 
hESF I, II or III polypeptide and a compound under conditions to 
permit binding to the receptor; and 

determining whether the compound binds to and inhibits the 
receptor by detecting the absence of a signal generated from the 
interaction of the hESF I, II or III with the receptor. 
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Box I Observations where certain claims were found unsearchable (Coo tin » ''".n of item 1 of first sheet) 



Tins micmu; it >n:ii ha> not been established in respect of certain claims und.r Article I7(2)(a) lor the following reasons: 



□ 



Claims Nos . 

because they relate' to subject matter not required to be searched by this Authority, namely: 



2 □ 



Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3- Q Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third 



of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



1 * As all required additional search fees were timely paid by the applicant, this international search report covers ail searchable 

claims. 

As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3 | X| As only some of the required additional search fees were timely paid by the applicant, this international search report covers 
only those claims fur which fees were paid, specifically claims Nos.: 
1-19 (group I) and species b-c 



4. Q No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by c him* Nos.: 



Remark uu Protest Q T he additional search fees were accompanied by the applicant's protest. 

P*] No protest accompanied the payment of additional search fees. 
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A CLASSIFICATION OF SUBJECT MATTER: 
IPC (6): 

C12N 1720, 5/00, 15/00; C12P 21/06: COIN 33/53; C07K 1/00. 14/00. 21/04; A61K 33/00 

A. CLASSIFICATION OF Sl/BJECT MATTER: 
US CL : 

435/7.1. 69.1. 240.1. 252.3, 320.1; 530/ 300, 324, 350; 930/ 10; 536/23.5 

B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

APS. USPAT. JPOABS; STN, MEDLINE, EMBASE, BIOSIS. CAPLUS, CONFSCl. DISSABS. J1CST-EPLUS, 
WPIDS, PATOSEP 

search terms; UB. uteroglobin, utcroglobin-like. PCR. degenerate PCR cloning. cclO, clara ceU 17 kDa protein, 
synonyms and authors 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 



1. This International Search Authority has found 7 inventions 
claimed in the international Application covered by the claims 
indicated below: 

This application contains the following inventions or groups of 
inventions which are not so linked as to form a single inventive 
concept under PCT Rule 13.1. tn order for all inventions to be 
examined, the appropriate additional examination fees must be 
paid. 

Group I, claims 1-19, are drawn to an isolated polynucleotide 
encoding a polypeptide, a vector, host cell, process for 
producing a cell and method of recombinant^ expressing human 
endometrial 3 pec fie steroid-binding factor (hESF). 
Group I contains claims directed to more than one species of the 
generic invention. These species arc deemed to lack Unity of 
Invention because they are not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for more than 
one species to be examined, the appropriate additional 
examination fees must be paid. The species are as follows: 

a) the polynucleotide/AA encoding hESF Uclaims 5, 9, 17) 

b) the polynucleotide/AA encoding hESF 1 1, (claims 6, 10. 18) 

c) the polynucleotidc/AA encoding hESF II. (Claims 7, 11, t9)The following claims are generic: 1-4. 
species listed above do not relate to a single inventive 

concept under PCT Rule 13.1 because, under PCT Rule 13.2. the 
species lack the same or corresponding special technical 
features for the following reasons: 

ThenucclicacidcncodingtheAAofhESFJ of species a, the nucclic 
acid encoding the AA of hESF II of species b, and the nucelic 
acid encoding the AA of hESF III of species c, each have 
materially different chemical structures and materially different 
functional properties. These chemical structures and functional 
properties are the special technical features dial identify each 
individual species and distinguish each species from the others, 
because none ot the special technical features is stored by the 
Npccics listed above. 
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compound which inhibiU 



Oivup II. cUunuj 20. 23. i> drawn lo a 
activate of an hESF polypcpiiiic. 

Group III, ciaim(s> 21, is drawn to i mehtod for the treatment of 
a patient wiih hESF polypeptide. 

Group IV. claimfe) 22. is drawn to a method for the treatment of 
* patient with a therapeutic amount of an hESF polypeptide. 
Group V, claim(s) 24, is drawn to a process for diagnoaing 
diseases by determining hESF polypeptide expression. 
Group VI, claim(s) 25. is drawn to a diagnostic to analyze for 
the presence of an hESF polypeptide. 

?S^IL^T ( p ?' rV mWn !? * mClh0d ° f id ^y^^PO^ binding and mhibilmg activation of an 
hESFpolypcptidc.Each of Croup, II-VH contains claims directed to more than one 
species of die generic invention. These species are deemed to 
lack Unity of Invention because they arc not so linked as 
toformasingle inventive concept under PCT Rule 13. 1. 
Inorderformorcthan one species to be examined, the appropriate 
additional examination fees must be paid. The species are as 
follows: 

a) the poIynucleotidc/AA encoding hESF 1, 

b) the polynuclcotide/AA encoding hESF II. 

c) the polynucleotide/ AA encoding hESF (I, 

Th« species lutwi above do not relate to a tingle inventive 
concept under PCT Rule 13.1 because, under PCT Rule 13 2 the 
species lack the same or corresponding special technical features 

for the folio wing, reasons: The nucelic acid encoding the AA of \ 
hESF I of species a, the nucelic acid encoding the AA of hESF II 1 
of species b, and the nucelic acid encoding the AA of hESF (II of 
species c, each have materially different chemical structures and 
materially different functional properties. These chemical 
structures and functional properties are the special technical 
features that identify each individual species and distinguish 
each species from the others, because none of the special 

invention, lifted Groupi ,. VH L relltek} ? l31 ' 1X2 wd I3 - 3 > fc ' below:!!* 

invenuvc concept under PCT Rule 13. 1 becatae, under PCT Rule 

13.., they lack the same or corresponding special technical 

features for the following reason,: The polypeptide of group 
l.ihe inhaling compound of group II, the methods of treatment 
of group* III and IV. the diagnoaiticf of groupV and VI, and the 
method of identifying an hESF inhibitor of group VII, each have 
matenally different chemical structure, and materially different 
funct.onal properties. These chemical structures and functional 
properties are the spcc i a , lvchnicll i featurel ,„,, idemi , ^ 

invention and distinguish each invention from the others because 
■■«'■»« ul ihc special technical feature, is shared by the separate 

groups. ' 



Accordingly. 37 CFR 1.475(d) do* NOT provide for multiple 
products/method* within a single PCT application, and the claims 
are not so |,nkcd by a special technical feature within the 
meamng of ihc PCT Rule 13.2 so as to form a single inventive 

concept. 
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